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FIGURE 9 

1 AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT 
5 1 AACCTCAGTA TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAACCTAA 
101 ATGGAGGTTG ATGTTGATAA TCAAGCTGAT AATCATGGCA TTCACTCAGC 
1 5 1 TCTTAAGACT TGTGAAGAAA TTAAAGAAGC TAAAACGTTG TACTCTTGGA 
20 1 TCCAGAAAGT TATTAGATGA AGAAATCAAT CTCAAAGTCA TTATAAAG AT 
25 1 TTAGAAGATA TTAAAATATT TGCGCAGACA AATATTGTTG CTACTCCACG 
301 AGACTATAAT GAAGAAGATT TTAAAGTTAT TGCAAGAAAA GAAGTATTTT 
35 1 CAACTGGACT AATGATCGAA CTTATTGACA AATGCTTAGT TGAACTTCTT 
401 TCATCAAGCG ATGTTTCAGA TAGACAAAAA CTTCAATGAT TTGGATTTCA 
451 ACTTAAGGGA AATCAATTAG CAAAGACCCA TTTATTAACA GCTCTTTCAA 
50 1 CTCAAAAGCA GTATTTCTTT CAAGACGAAT GGAACCAAGT TAGAGCAATG 
55 1 ATTGGAAATG AGCTCTTCCG ACATCTCTAC ACTAAATATT TAATATTCCA 
60 1 GCGAACTTCT GAAGGAACTC TTGTTCAATT TTGCGGGAAT AACGTTTTTG 
651 ATCATTTGAA AGTCAACGAT AAGTTTGACA AAAAGCAAAA AGGTGGAGCA 
70 1 GCAGACATGA ATGA ACCTCG ATGTTGATCA ACCTGCAAAT ACAATGTCAA 
75 1 GAATGAGAAA GATCACTTTC TCAAC AAC AT CAACGTGCCG AATTGGAATA 
801 ATATGAAATC AAGAACCAGA ATATTTTATT GCACTCATTT TAATAGAAAT 
85 1 AACCAATTCT TCAAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 
901 AGCGATGGAC AGAGCTCAGA CGATATTCAC GAATATATTC AGATTTAATA 
95 1 GAATTAGAAA GAAGCTAAAA GATAAGGTTA TCGAAAAAAT TGCCTACATG 
1001 CTTGAGAAAG TCAAAGATTT TAACTTC AAC TACTATTTAA C AAAATCTTG 
1 05 1 TCCTCTTCCA GAAAATTGGC GGGAACGG AA ACAAAAAATC GAAAACTTGA 
1 101 TAAATAAAAC TAGAGAAGAA AAGTCGAAGT ACTATGAAGA GCTGTTTAGC 
1151 TACACAACTG ATAATAAATG CGTC ACAC AA TTTATTAATG AATTTTTCTA 
1 20 1 C AATATACTC CCCAAAGACT TTTTG ACTGG AAGAAACCGT AAGAATTTTC 
1251 AAAAGAAAGT TAAGAAATAT GTGGAACTAA ACAAGCATGA ACTCATTCAC 
1 30 1 AAAAACTTAT TGCTTGAGAA GATCA ATACA AGAGAAATAT CATGGATGCA 
1351 GGTTGAGACC TCTGCAAAGC A II 1 II ATT A TTTTG ATCAC GAAAACATCT 
1 40 1 ACGTCTTATG G AAATTGCTC CG ATGG ATAT TCGAGGATCT CGTCGTCTCG 
1 45 1 CTGATTAGAT GATTTTTCTA TGTCACCGAG CAACAGAAAA GTTACTCCAA 
1 501 AACCTATTAC TACAGAAAGA ATATTTGGGA CGTCATTATG AAAATGTCAA 
1551 TCGCAGACTT AAAGAAGGAA ACGCTTGCTG AGGTCCAAGA AAAAGAGGTT 
1 60 1 GAAGAATGGA AAAAGTCGCT TGG ATTTGCA CCTGGAAAAC TCAG ACTAAT 
1 65 1 ACCGAAGAAA ACTACTTTCC GTCCAATTAT G ACTTTCAAT AAGAAGATTG 
1 701 TAAATTCAGA CCGGAAGACT ACAAAATTAA CTACAAATAC GAAGTTATTG 
1751 AACTCTCACT TAATGCTTAA GACATTGAAG AATAGAATGT TTAAAGATCC 
1801 TTTTGGATTC GCTGTTTTTA ACTATGATGA TGTAATGAAA AAGTATGAGG 
1 85 1 AGTTTGTTTG CAAATGGAAG CAAGTTGGAC AACCAAAACT CTTCTTTGCA 
1901 ACTATGGATA TCGAAAAGTG ATATGATAGT GTAAACAGAG AAAAACTATC 
195 1 AACATTCCTA AAAACTACTA AATTACTTTC TTCAGATTTC TGGATTATGA 
200 1 CTGCACAAAT TCTAAAGAGA AAGAATAACA TAGTTATCGA TTCGAAAAAC 
205 1 TTTAGAAAGA AAGAAATGAA AGATTATTTT AGACAGAAAT TCCAGAAGAT 
2101 TGCACTTG A A GG AGG ACAAT ATCC A ACCTT ATTCAGTGTT CTTGA AAATG 
2151 AACAAAATGA CTTAAATGCA AAGAA AACAT TAATTGTTGA AGCAAAGCAA 
220 1 AGAAATTATT TTAAG AAAG A TAACTTACTT CA ACCAGTCA TTAATATTTG 
225 1 CCAATATAAT TACATTAACT TTAATGGGA A GTTTTATAAA CAAACAAAAG 
2301 GAATTCCTCA AGGTCTTTGA GTTTCATCAA TTTTGTCATC ATTTTATTAT 
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2351 GCAACATTAG AGGAAAGCTC CTTAGGATTC CTTAGAGATG AATCAATGAA 
2401 CCCTGAAAAT CCAAATGTTA ATCTTCTAAT GAGACTTACA GATGACTATC 
2451 TTTTGATTAC AACTCAAGAG AATAATGCAG TATTGTTTAT TGAGAAACTT 
2501 ATAAACGTAA GTCGTGAAAA TGGATTTAAA TTCAATATGA AGAAACTACA 
255 1 GACTAGTTTT CC ATTAAGTC CAAGCAAATT TGCAAAATAC GGAATGGATA 
2601 GTGTTGAGGA GCAAAATATT GTTCAAGATT ACTGCGATTG GATTGGCATC 
2651 TCAATTGATA TGAAAACTCT TGCTTTAATG CCAAATATTA ACTTGAGAAT 
2701 AGAAGGAATT CTGTGTACAC TCAATCTAAA CATGCAAACA AAGAAAGCAT 
2751 CAATGTGGCT CAAGAAGAAA CTAAAGTCGT TTTTAATGAA TAACATTACC 
2801 CATTATTTTA GAAAGACGAT TACAACCGAA GACTTTGCGA ATAAAACTCT 
2851 CAACAAGTTA TTTATATCAG GCGGTTACAA ATACATGCAA TGAGCCAAAG 
2901 AATACAAGGA CCACTTTAAG AAGAACTTAG CTATGAGCAG TATGATCGAC 
295 1 TTAG AGGTAT CTAAAATTAT ATACTCTGTA ACCAGAGC AT TCTTTAAATA 
3001 CCTTGTGTGC AATATTAAGG ATACAATTTT TGGAGAGGAG CATTATCCAG 
^ 305 1 ACTTTTTCCT TAGCACACTG AAGCACTTTA TTGAAATATT CAGCACAAAA 

. n 3101 AAGTACATTT TCAACAGAGT TTGCATGATC CTCAAGGCAA AAGAAGCAAA 
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1 MEVDVDNQAD NHGIHSALKT CEEIKEAKTL YSWIQKVIRC RNQSQSHYKJD 
51 LEDDGFAQT NIVATPRDYN EEDFKVIARK EVFSTGLMIE LIDKCLVELL 
101 SSSDVSDRQK LQCFGFQLKG NQLAKTHLLT ALSTQKQYFF QDEWNQVRAM 
151 IGNELFRHLY TKYLIFQRTS EGTLVQFCGN NVFDHLKVND KFDKKQKGGA 
201 ADMNEPRCCS TCKYNVKNEK DHFLNNINVP NWNNMKSRTR 1FYCTHFNRM 
251 NQFFKKHEFV SNKNNISAMD RAQTIFTNIF RFNRIRKKLK DKVIEKIAYM 
301 LEKVKDFNFN YYLTKSCPLP ENWRERKQKJ ENLINKTREE KSKYYEELFS 
351 YTTDNKCVTQ FINEFFYNIL PKDFLTGRNR KNFQKKVKKY VELNKHELIH 
401 KNLLLEKINT REISWMQVET SAKHFYYFDH ENIYVLWKLL RWIFEDLWS 
451 LIRCFFYVTE QQKSYSKTYY YRKN1WDVIM KMSIADLKKE TLAEVQEKEV 
501 EEWKKSLGFA PGKJLRLIPKK TTFRPIMTFN KJCIVNSDRKT TKJLTTNTKLL 
551 NSHLMLKTLK NRMFKDPFGF AVFNYDDVMK KYEEFVCKWK QVGQPKLFFA 
601 TMDBEKCYDS VNREKLSTFL KTTKLLSSDF WIMTAQILKR KNNIVIDSKN 
651 FRKKEMKDYF RQKFQKIALE GGQYPTLFSV LENEQNDLNA KKTLIVEAKQ 
701 RNYFKKDNLL QPVINICQYN YINFNGKFYK QTKGIPQGLC VSSILSSFYY 
75 1 ATLEESSLGF LRDESMNPEN PNVNLLMRLT DDYLLITTQE NNAVLFIEKL 
801 INVSRENGFK FNMKKLQTSF PLSPSKFAKY GMDSVEEQNI VQDYCDWIGI 
851 SIDMKTLALM PNINLRIEGI LCTLNLNMQT KKASMWLKKK LKSFLMNNIT 
901 HYFRKTITTE DFANKTLNKL FISGGYKYMQ CAKEYKDHFK KNLAMSSMID 
951 LEVSKIIYSV TRAFFKYLVC NIKDTIFGEE HYPDFFLSTL KHFIEIFSTK 
1001 KYIFNRVCMI LKAKEAKLKS DQCQSLIQYD A 
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1 CCCCAAAACC CCAAAACCCC AAAACCCCTA TAAAAAAAGA AAAAATTGAG 
5 1 GTAGTTTAG A AATAAAATAT TATTCCCGCA CAAATGGAGA TGGATATTGA 
101 TTTGGATGAT ATAGAAAATT TACTTCCTAA TACATTCAAC AAGTATAGCA 
1 5 1 GCTCTTGTAG TGACAAGAAA GG ATGCAAAA CATTGAAATC TGGCTCGAAA 
201 TCGCCTTCAT TGACTATTCC AAAGTTGCAA AAACAATTAG AGT TCTAC TT 
251 CTCGGATGCA AATCTTTATA ACGATTCTTT CTTGAGAAAA TTAGTTTTAA 
301 AAAGCGGAGA GCAAAGAGTA GAAATTGAAA CATTACTAAT GTTTAAATAA 
351 AATCAGGTAA TGAGGATTAT TCTATTTTTT AGATCACTTC TTAAGGAGCA 
401 TTATGGAGAA AATTACTTAA TACTAAAAGG TAAACAGTTT GGATTATTTC 
451 CCTAGCCAAC AATGATGAGT ATATTAAATT CATATGAGAA TGAGTCAAAG 
501 GATCTCGATA CATCAGACTT ACCAAAGACA AACTCGCTAT AAAACGCAAG 
551 AAAAAGTTTG ATAATCGAAC AGCAGAAGAA CTTATTGCAT TTACTATTCG 
601 TATGGGTTTT ATTACAATTG TTTTAGGTAT CGACGGTGAA CTCCCGAGTC 
651 TTGAGACAAT TGAAAAAGCT GTTTACAACT GAAGGAATCG CAGTTCTGAA 
701 AGTTCTGATG TGTATGCCAT TATTTTGTGA ATTAATCTCA AATATCTTAT 
75 1 CTCAATTTAA TGGATAGCTA TAG AAACAAA CCAAATAAAC CATGCAAGTT 
801 TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATGCACAC TGAATTTATA 
851 TTGGATTCTT AAAGCATAGA TACACAGAAT GCTTTAGAGA CTGATTTAGC 
90 1 TTAC AACAGA TTACCTGTTT TG ATTACTCT TGCTCATCTC TTATATCTTT 
951 AAAAGAAGCA GGCGAAATGA AAAGAAGACT AAAGAAAGAG ATTTCAAAAT 
1001 TTGTTGATTC TTCTGTAACC GGAATTAACA ACAAGAATAT TAGCAACGAA 
1 05 1 AAAGAAG AAG AGCTATCACA ATCCTGATTC TTAAAGATTT CAAAAATTCC 
1 101 AGGTAAGAGA GATACATTCA TTAAAATTCA TATATTATAG TTTTTCATTT 
1151 CACAGCTGTT ATTTTCTTTT ATCTTAACAA TATTTTTTGA TTAGCTGGAA 
1201 GTAAAAAGTA TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT 
1 25 1 TCAC ATTC AT AG ATCGACCT TCATATATCC AATACGATGA TAAG GAAACA 
1301 GCAGTCATCC GTTTTAAAAA TAGTGCTATG AGGACTAAAT TTTTAGAGTC 
1351 AAGAAATGGA GCCG AAATCT TAATCAAAAA GAATTGCGTC GATATTGCAA 
1401 AAGAATCGAA CTCTAAATCT TTCGTTAATA AGTATTACCA ATCTTGATTG 
1451 ATTGAAGAGA TTGACGAGGC AACTGCACAG AAGATCATTA AAGAAATAAA 
1501 GTAACTTTTA TTAATTAGAG AATAAACTAA ATTACTAATA TAGAGATCAG 
1551 CGATCTTC AA TTGACGAAAT AAAAGCTGAA CTAAAGTTAG ACAATAAAAA 
1 60 1 ATAC AAACCT TGGTC AAAAT ATTGAGGAAG GAAAAGAAGA CC AGTTAGCA 
1 65 1 AAAGAAA AAA TAAGGCAATA AATAAAATGA GTACAGAAGT GAAGAAATAA 
1701 AAGATTTATT II I 1 ICAATA ATTTATTGAA AAGAGGGGTT TTGGGGTTTT 
1751 GGGGTTTTGGGG 
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TTAAAGATTTC AAAAATTCC AGGTAAG AG AG AT AC ATTC ATTAAAATTC ATAT ATT ATAC 
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FIGURE 18 
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FIGURE 19 



1 aactcattla attactaatt taatcaacaa gattgataaa aagcagtaaa taaaacccaa 
61 tagatttaat ttagaaagta tcaattgaaa aatggaaatt gaaaacaact aagcacaata 
121 gccaaaagcc gaaaaattgt ggtgggaact tgaattagag atgcaagaaa accaaaatga 
181 tatataagtt agggttaaga ttgacgatcc taagcaatat ctcgtgaacg tcactgcagc 
24 1 atgtttgttg taggaaggla gttactacta agataaagat gaaagaagat atatcatcac 
301 taaagcactt cttgaggtgg ctgagtctga tcctgagttc atctgctagt tggcagtcta 
361 catccgtaat gaactttaca tcagaactac cactaactac attgtagcat tttgtgttgt 
421 ccacaagaat actcaaccat tcatcgaaaa gtacttcaac aaagcagtac ttttgcctaa 
481 tgacttactg gaagtctgtg aatttgcata ggttctctat atttttgatg caactgaatt 
541 caaaaatttg tatcttgata ggatactttc ataagatatt cgtaaggaac tcactttccg 
601 taagtgttla caaagatgcg tcagaagcaa gttttctgaa ttcaacgaat actaacttgg 
661 taagtattgc actgaatcct aacgtaagaa aacaatgttc cgttacctct cagttaccaa 
721 caagtaaaag tgggattaaa ctaagaagaa gagaaaagag aatctcttaa ccaaacttta 
781 ggcaataaag gaatctgaag ataagtccaa gagagaaact ggagacataa tgaacgttga 
84 1 agatgcaatc aaggctttaa aaccagcagt tatgaagaaa atagccaaga gatagaatgc 
901 catgaagaaa cacatgaagg cacctaaaat tcctaactct accttggaat caaagtactt 
961 gaccttcaag gatctcatta agttctgcca tatttctgag cctaaagaaa gagtctataa 
1021 gatccttggt aaaaaatacc ctaagaccga agaggaatac aaagcagcct ttggtgattc 
1081 tgcatctgca cccttcaatc ctgaattggc tggaaagcgt atgaagattg aaatctctaa 
1141 aacatgggaa aatgaactca gtgcaaaagg caacactgct gaggtttggg ataatttaat 
1201 ttcaagcaat taactcccat atatggccat gttacgtaac ttgtctaaca tcttaaaagc 
1261 cggtgtttca gatactacac actctattgt gatcaacaag atttgtgagc ccaaggccgt 
1321 tgagaactcc aagatgttcc ctcttcaatt ctttagtgcc attgaagctg ttaatgaagc 
1381 agttactaag ggattcaagg ccaagaagag agaaaatatg aatcttaaag gtcaaatcga 
1441 agcagtaaag gaagttgttg aaaaaaccga tgaagagaag aaagatatgg agttggagta 
1501 aaccgaagaa ggagaatttg ttaaagtcaa cgaaggaatt ggcaagcaat acattaactc 
1561 cattgaactt gcaatcaaga tagcagttaa caagaattta gatgaaatca-aaggacacac 
1621 tgcaatcttc tctgatgttt ctggttctat gagtacctca atgtcaggtg gagccaagaa 
1 68 1 gtatggttcc gttcgtactl gtctcgagtg tgcattagtc cttggtttga tggtaaaata 
1 74 1 acgttgtgaa aagtcctcat tctacatcn cagttcacct agttctcaat gcaataagtg 
1 80 1 ttactta'gaa gttgatctcc ctggagacga actccgtcct tctatgtaaa aacttttgca 
1861 agagaaagga aaacttggtg gtggtactga tttcccctat gagtgcattg atgaatggac 
1921 aaagaataaa actcacgtag acaatatcgt tattttgtct gatatgatga ttgcagaagg 
1981 atattcagat atcaatgtta gaggcagttc cattgttaac agcatcaaaa agtacaagga 
204 1 tgaagtaaat cctaacatta aaatctttgc agttgactta gaaggttacg gaaagtgcct 
2101 taatctaggt gatgagttca atgaaaacaa ctacatcaag atattcggta tgagcgattc 
2161 aatcnaaag ttcatttcag ccaagcaagg aggagcaaat atggtcgaag ttatcaaaaa 
222 1 ctttgccctt caaaaaatag gacaaaagtg agtttcttga gattcttcta taacaaaaat 
2281 ctcaccccac ttttttgttt tattgcatag ccattatgaa atttaaatta ttatctattt 
234 1 atttaagtta cttacatagt ttatgtatcg cagtctatta gcctattcaa atgattctgc 
2401 aaagaacaaa aaagattaaa a 




FIGURE 20 

MEIENNQAQQPKAEKLWWELELEMQENQNDIQVRVKIDDPKQYL 

\^VTAACLLQEGSYYQDKJDERKYIITKALLEVAESDPEFICQLA\nnRNELYIRTTTN 

YIVAFCWHKNTQPFIEKYFNKAVLLPNDLLEVCEFAQVLYIFDATEFKNLYLDRILS 

QDIRKELTFPJ<CLQRCVRSKPSEFNEYQLGKYCTESQRjCKTMFRYLSVThnCQKWDQTK 

KJ<JIK£NLLTKLQAIK£SEDKSKJIETGDIMN^ 

APKJPNSTLESKYLTFKDLIKJCmSEPKERVYKJLGKJCYPKTEEEYKAAFGDSASAP 

FNPELAGKRMKIEISKTWENELSAKGNTAEVWD>n.ISSNQLPYMAMLRNLSNILKAGV 

SDTTHSIVmKJCEPKAVENSKMFPLQFFSAIEAVNEAVTKGFKAKKRENMNLKGQIE 

A VKE WEKTDEEKKDM ELEQTEEG EF VK VNEG IGKQ YINS I ELAIK1A VNKNLDEIKG 

HTAIFSDVSGSMSTSMSGGAKKYGSVRTCLECALVLGLMVKQRCEKSSFY1FSSPSSQ 

CNKCYLEVDLPGDELRPSMQKLLQEKGKLGGGTDFPYECIDEWTKNKTHVDN1VILSD 

MMIAEGYSDINVRGSSIVNSIKKYKDEVNPNIKIFAVDLEGYGKCLNLGDEFNENNY1 

KIFGMSDSILKFISAKQGGANMVEVIKNFALQKJGQK 



FIGURE 21 

1 tcaatactat taattaataa ataaaaaaaa gcaaactaca aagaaaatgt caaggcgtaa 
6 1 ctaaaaaaag ccataggctc ctataggcaa tgaaacaaat cttgattttg tattacaaaa 
121 tctagaagtt tacaaaagcc agattgagca ttataagacc tagtagtaat agatcaaaga 
1 g 1 ggaggatctc aagcttttaa agttcaaaaa ttaagattag gatggaaact ctggcaacga 
241 tgatgatgat gaagaaaaca actcaaataa ataataagaa ttattaagga gagtcaatta 
301 gattaagtag caagtttaat tgataaaaaa agttggttct aaggtagaga aagatttgaa 
361 tttgaacgaa gatgaaaaca aaaagaatgg actttctgaa tagcaagtga aagaagagta 
42 1 attaagaacg attactgaag aataggttaa gtattaaaat ttagtattta acatggacta 
481 ccagttagat ttaaatgaga gtggtggcca tagaagacac agaagagaaa cagattatga 
54 1 tactgaaaaa tggtttgaaa tatctcatga ccaaaaaaat tatgtatcaa tttacgccaa 
60 1 ctaaaagaca tcatattgtt ggtggcttaa agattatttt aataaaaaca attatgatca 
66 1 tcttaatgta agcattaaca gactagaaac tgaagccgaa ttctatgcct ttgatgattt 
721 ttcacaaaca atcaaactta ctaataattc ttactagact gttaacatag acgttaattt 
78 1 tgataataat ctctgtatac tcgcattgct tagattttta ttatcactag aaagattcaa 
84 1 tattttgaat ataagatctt cttatacaag aaattaatat aattttgaga aaattggtga 
901 gctacttgaa actatcttcg cagttgtctt ttctcatcgc cacttacaag gcattcattt 
961 acaagttcct tgcgaagcgt tctaatattt agttaactcc tcatcataaa ttagcgttaa 
1 02 1 agatagciaa ttataggtat actctttctc tacagactta aaattagttg acactaacaa 
1081 agtccaagat tattttaagt tcttataaga attccctcgt ttgactcatg taagctagta 
1141 ggctatccca gttagtgcta ctaacgctgt agagaacctc aatgttttac ttaaaaaggt 
1201 caagcatgct aatcttaatt tagtttctat ccctacctaa ttcaattttg atttctactt 
1261 tgttaattta taacatttga aattagagtt tggattagaa ccaaatattt tgacaaaaca 
1321 aaagcttgaa aatctacttt tgagtataaa ataatcaaaa aatcttaaat ttttaagatt 
1381 aaacttttac acctacgttg cttaagaaac ctccagaaaa cagatattaa aacaagctac 
1441 aacaatcaaa aatctcaaaa acaataaaaa tcaagaagaa actcctgaaa ctaaagatga 
1501 aactccaagc gaaagcacaa gtggtatgaa attttttgat catctttctg aattaaccga 
1561 gcttgaagat ttcagcgtta acttgtaagc tacccaagaa atttatgata gcttgcacaa 
1621 acttttgatt agatcaacaa atttaaagaa gttcaaatta agttacaaat atgaaatgga 
1681 aaagagtaaa atggatacat tcatagatct taagaatatt tatgaaacct taaacaatct 
1 74 1 taaaagatgc tctgttaata tatcaaatcc tcatggaaac atttcttatg aactgacaaa 
1 80 1 taaagattct actttttata aatttaagct gaccttaaac taagaattat aacacgctaa 
1861 gtatactm aagtagaacg aattttaatt taataacgtt aaaagtgcaa aaattgaatc 
1 92 1 ttccFcatta gaaagctlag aagatattga tagtctttgc aaatctattg cttcttgtaa 
1 98 1 aaatttacaa aatgttaata natcgccag tttgctctat cccaacaata tttagaaaaa 
204 1 tcctttcaat aagcccaatc ttctattttt caagcaattt gaataattga aaaatttgga 
2101 aaatgtatct atcaactgta ttcttgatca gcatatactt aattctattt cagaattctt 
2161 agaaaagaat aaaaaaataa aagcattcat tttgaaaaga tattatttat tacaatatta 
222 1 tcttgattat actaaattat ttaaaacact tcaatagtta cctgaattaa attaagttta 
2281 cattaattag caattagaag aattgactgt gagtgaagta cataagtaag tatgggaaaa 
234 1 ccacaagcaa aaagctttct atgaaccatt atgtgagttt atcaaagaat catcctaaac 
2401 cctttagcta atagattttg accaaaacac tgtaagtgat gactctatta aaaagatttt 
2461 agaatctata tctgagtcta agtatcatca ttatttgaga ttgaacccta gttaatctag 
2521 cagtttaatt aaatctgaaa acgaagaaat ttaagaactt ctcaaagctt gcgacgaaaa 
2581 aggtgtttta gtaaaagcat actataaatt ccctctatgt ttaccaactg gtacttatta 
264 1 cgattacaat tcagatagat ggtgattaat taaatattag tttaaataaa tattaaatat 
2701 tgaatatttc tttgcttatt atttgaataa tacatacaat agtcattttt agtgttttga 
276 1 atatatttta gttatttaat tcattatttt aagtaaataa ttatttttca atcatttttt 
2821 aaaaaatcg 
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MSRRNQKKPQAPIGNETNLDFVLQNLEVYKSQIEHYKTQQQQIK 
EEDLKLLKFKNQDQDGNSGNDDDDEENNSNKQQELLRJRVNQIKQQVQLIKKVGSKVEK 
DLNLNEDENICKNGLSEQQVK£EQLRTITEEQVKYQNLVFNMDYQLDLNESGGHRRHRR 
ETDYDTEKWFEISHDQK>IYVSIYANQKTSYCWW^ 

FYAFDDFSQTIK1TNNSYQTWIDVNFDNNLCILALLRFLLSLERJFNILNIRSSYTRN 

QYNFEKJGELLETIFAVVFSHRHLQGIHLQVPCEAFQYLVNSSSQISVKDSQLQVYSF 

STDLKJLVDTNKVQDYFKJLQEFPRLTHVSQQAIPVSATNAVENnLhm.LKKVKHANLNL 

VSIPTQFNFDFYFVNLQHLKJLEFGLEPNILTKQKLENLLLSIKQSKNLKFLRLNFYTY 

VAQETSRKQILKQATTIKNLKhfNKNQEETPETKDETPSESTSGMKFFDHLSELTELED 

FSVNLQATQEIYDSLHKLLIRSTNLKXFKLSYKYEMEKSKMDTFIDLKNIYETLNNLK 

RCSVNISNPHGNISYELTNKDSTFYKJKJLTLNQELQHAKYTFKQhffiFQFhWVKSAKIE 

SSSLESLEDlDSLCKSIASCKNLQNVNIIASLLYPhfNIQKNPFNKPNLLFFKQFEQLK 

NLEhA/srNCILDQHILNSISEFLEKNKKIKAFILKRYYLLQYYLDYTKLFKTLQQLPE 

LNQVYINQQLEELTVSEVHKQVWENHKQKAFYEPLCEFIKESSQTLQLIDFDQNTVSD 

DSIKKILESISESKYHHYLRLNPSQSSSLIKSENEEIQELLKACDEKGVLVKAYYKFP 

LCLPTGTYYDYNSDRW 
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FIGURE 23 

MKJLFEFIQDKLDIDLQTNSTYKENLKCGHFNGLDEILTTCFAL 

PNSRKJALPCLPGDLSHKAVIDHCUYLLTGELYNNVLTFGYKJARhfEDVNNSLFCHS 

ANVhTVTLLKGAAWKMFHSLVGTYAFVDLLrNYTVIQFNGQFFTQIVGNRCNEPHLPPK 

WVQRSSSSSATAAQIKQLTEPVTNKQFLHKLNINSSSFFPYSKILPSSSSIKKLTDLR 

EAIFPTNLV1GPQRLKVRJNLTLQKLLKRHKRLNYVSILNSICPPLEGTVLDLSHLSR 

QSPKERVLKFIIVILQKiLPQEMFGSKKNKGKJIKNLNLLLSLPLNGYLPFDSLLKJa 

RLKI)FRWLFISDIWFTKJ^WFENLNQLAICFISWLFRQLIPKIIQTFFYCTEISSTVT1 

VWRl^DTW^^<l.ITPFIVEYFKTYLVE^n^VCRNHNSYTLSNFNHSKMRIIPKKS^fNEFR 

IIAIPCRGADEEEFTIYKENHK>IAIQPTQKJLEYLRNKRPTSFTKIYSPTQIADRJKJE 

FKQRLLKJCFNhTVLPELYFMKFDVKSCYDSIPRMECMRJLKDALKNENGFFVRSQYFFN 

TOTGVLKLFKWASRVPKPYELYIDNVRTVHLSNQDVrNVVEMEIFKTALWVEDKCY 

IREDGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPSQDTLILKLADDFLIISTDQQQV 

INIKJaAMGGFQKYNAK^NRJDKJLAVSSQSDDDTVIQFCAMfflFVKELEVWKHSSTMN 

NFH1RSKSSKGIFRSLIALFNTRJSYKTIDTNLNSTNTVLMQIDHVVKNISECYKSAF 

KDLSrNVTQNMQFHSFLQRllEMTVSGCPITKCDPLIEYEVRFTlLNGFLESLSSNTS 

KFKDN'IILLRKEIQHLQAYIYIYIHIVN 
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FIGURE 24 



Oxytricha LCVSYILSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 
Euplotes LCVSS1LSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLT 
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FIGURE 26 



ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 

GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 

TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 

TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 

ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 

TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 

GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 

TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 

GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 

ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 

TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 

GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 

TAATTTGACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 

GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 

ATCACCAAAGGAACGAGTCTTGAAATTTATCATTGTTATTTTACAGAAGTTATTACCCCA 

AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 

AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 

GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 

GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 

ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 

TG AT ACTTGG AAT AAACTTAT CAC CCCTTTT ATCGTAG AATATTTTAAG ACGTACTT AGT 

CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 

AATGAGGATTATACCAAAAAAAAGTAATAATGAGTTCAGGATTATTGCCATCCCATGCAG 

AGGGGCAGACGAAGAAGAATTCACAATTTATAAGGAGAATCACAAAAATGCTATCCAGCC 

CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCGGACTAGTTTTACTAAAATATA 

TTCTCCAACGCAAATAGCTGACCGTATCAAAGAATTTAAGCAGAGACTTTTAAAGAAATT 

TAATAATGTCTTACCAGAGCTTTATTTCATGAAATTTGATGTCAAATCTTGCTATGATTC 

CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 

TTTCGTTAGATCTCAATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 

TGTTAACGCTAGCAGAGTACCAAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 

TCATTTATCAAATCAGGATGTTATAAACGTTGTAGAGATGGAAATATTTAAAACAGCTTT 

GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGTCTTTTTCAGGGCTCTAGTTTATC 

TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 

CAGTCCTAGCCAGGACACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 

AGACCAACAGCAAGTGATCAATATCAAAAAGCTTGCCATGGGCGGATTTCAAAAATATAA 

TGCGAAAGCCAATAGAGACAAAATTTTAGCCGTAAGCTCCCAATCAGATGATGATACGGT 

TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 

CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGTTTAAT 

AGCGCTGTTTAACACTAGAATCTCTTATAAAACAATTGACACAAATTTAAATTCAACAAA 

CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 

TTTTAAGGATCTATCAATTAATGTTACGCAAAATATGCAATTTCATTCGTTCTTACAACG 

CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 

TGAGGTACGATTCACCATATTGAATGGATTTTTGGAAAGCCTATCTTCAAACACATCAAA 

ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 




AKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIG1RQHLKR 

VQLRDVSEAEVRQHREARPALLTSRLRPIPKPDGLRPIVNMDYVVGARTFRREKR 

AERLTSRVKALFSVLNYERA 
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GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 

TTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCl 1 1 1 lCTACC 

GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAA 

GAGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGA 

AGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGACG 

GGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCG 

CAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGC 

GTGCTCAACTACGAGCGGGCGCG 
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MTEHHTPKSRJLRFLENQYVYLCTLNDYVQLVLRGSPASSYSN1CERLRSDVQTSFSIFLHSTVVGF 

DSKPDEGVQFSSPKCSQSELIANVVKQMFDESFERRRNLLMKGFSMNHEDFRAMHVNGVQNDLV 

STFPNYLISILESKNWQLLLE11GSDAMHYLLSKGSIFEALPNDNYLQ1SGIPLFKNNVFEETVSKJCRX 

RTIETSITQNKSARKEVSWNSISISRFSIFYRSSYKKFKQDLYFNLHS1CDRNTVHMWLQWIFPRQFG 

LrNAFQVKQLHKVIPLVSQSTVVPKRLLKVYPLIEQTAKRLHRJSLSKVYNHYCPYIDTHDDEKILS 

YSLKPNQVFAFLRSILVRVFPKLIWGNQRJFEIILKDLETFLKJLSRYESFSLHYLMSNIKISEIEWLVL 

GKJISNAKMCLSDFEKRKQIFAEFIYWLYNSFIIPILQSFFYITESSDLRNRTVYFRKDIWKLLCRPFIT 

SMKMEAFEKINEhn^RMDTQKTTLPPAVIRLLPKKNTFRLITNLRKRFLIKMGSNKXMLVSTNQT 

LRPVASILKHLFNEESSGIPFNLEVYMKLLTFKKDLLKHRMFGRKKYFVRIDIKSCYDRIKQDLMFR 

1VKKKLKDPEFVIRKYATIHATSDRATKNFVSEAFSYFDMVPFEKVVQLLSMKTSDTLFVDFVDY 

WTKSSSEIFKMLKEHLSGHIVKIGNSQYLQKVGIPQGSILSSFLCHFYMEDLIDEYLSFTKKKGSVL 

LRVVDDFLFITVNKXDAKKFLNLSLRGFEKJWFSTSLEKTVINFENSNGIINNTFFNESKKRMPFFG 

FSVNMRSLDTLLACPKIDEALFNSTSVELTKHMGKSFFYKILRSSLASFAQVFIDITHMSKFNSCCNI 

YRLGYSMCMRAQAYLKRMKDIFIPQRMFITDLLNVIGRKIWKKLAEILGYTSRRFLSSAEVKWLFC 

LGMRDGLKPSFKYHPCFEQLIYQFQSLTDLIKPLRPVLRQVLFLHRRJAD 
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ggtaccgatttacmcctncncataagctaattgcttcctcgaacgctcctaaatctctggaaatamaacaagaactcaataacaataccaagtcaaattccaatatgaagg 

tgtuttagtgatcgataatamctatmatcggtcgttaccaagtataaggacaaaaagaacaacnccnccccctaaagactmacmanaamactmcaaatatam 

ggncgcnacttttaatcgtggtactgttttagctgctacttctagccaaccgcgtgtttctaccccgtcattggatatagctcnggagtagctcacagaaatccttacaaatctt 

ctgatgagactatanagancattacagtccgtgcatattcnaacatggagccttacactttagatgagtcacgtcgcatgatggagtamggtatcatccaacgmgccttg 

aaaaggttgataanamgcaaaatcatgtccttagtggtggtaatccgcgaaagttttttgatgcttgcacacgtctagcatgangagatancaaaaatttctatccactacaa 

ctccmaacgcggttttattmctatmctattctcatgttgttccaaatatgtatcatctcgtattaggctttmccgtmactcctggaatcgtacc 

aataatctaaanagtncgcnataangatagtagtagaaagattggtgattctactcgtgtaatgttattagtttaaagatacmgcaaaacamattagctatcanatataaaa 

aaaatcctataanataaatanaatcaatatngcggtcactatttatttaaaacgttatgatcagtaggacactttgcatatatatagttatgcttaatggttacttgtaacttgcAT 

GACCGAACACCATACCCCCAAAAGCAGGATTCTTCGCTTTCTAGAGAATCAATATGTATACCTATGTA 

CCTTAAATGATTATGTACAACTTGTTTTGAGAGGGTCGCCGGCAAGCTCGTATAGCAATATATGCGAA 

CGCTTGAGAAGCGATGTACAAACGTCCTTTTCTATTTTTCTTCATTCGACTGTAGTCGGCTTCGACAGT 

AAGCCAGATGAAGGTGTTCAATTTTCTTCTCCAAAATGCTCACAGTCAGAGgtatatatatttttgttttgatttttttctattcg 

ggatagctaatatatgggcagCTAATAGCGAATGTTGTAAAACAGATGTTCGATGAAAGTTTTGAGCGTCGAAGGA 

ATCTACTGATGAAAGGGTTTTCCATGgtaaggtattctaangtgaaatamacctgcaattactgmcaaagagangtamaaccgata 

TCATGAAGATTTTCGAGCCATGCATGTAAACGGAGTACAAAATGATCTCGTTTCTACTTTTCCTAATTA 

CCTTATATCTATACTTGAGTCAAAAAATTGGCAACTTTTGTTAGAAATgtaaataccggttaagatgngcgcactttgaaca 

a K actgacaagtatagTATCGGCAGTGATGCCATGCATTACTTATTATCCAAAGGAAGTATTTTTGAGGCTCTTC 

CAAATGACAATTACCTTCAGATTTCTGGCATACCACTTTTTAAAAATAATGTGTTTGAGGAAACTGTGT 

CAAAAAAAAGAAAGCGAACCATTGAAACATCCATTACTCAAAATAAAAGCGCCCGCAAAGAAGTTTC 

CTGGAATAGCATTTCAATTAGTAGGTTTAGCATTTTTTACAGGTCATCCTATAAGAAGTTTAAGCAAGgt 

aactaatactgnatccncataactaatmagATCTATATTTTAACTTACACTCTATTTGTGATCGGAACACAGTACACATC 

TGGCTTCAATGGATTTTTCCAAGGCAATTTGGACTTATAAACGCATTTCAAGTGAAGCAATTGCACAA 

AGTGATTCCACTGGTATCACAGAGTACAGTTGTGCCCAAACGTCTCCTAAAGGTATACCCTTTAATTGA 

ACAAACAGCAAAGCGACTCCATCGTATTTCTCTATCAAAAGTTTACAACCATTATTGCCCATATATTGA 

CACCCACGATGATGAAAAAATCCTTAGTTATTCCTTAAAGCCGAACCAGGTGTTTGCGTTTCTTCGATC 

CATTCTTGTTCGAGTGTTTCCTAAATTAATCTGGGGTAACCAAAGGATATTTGAGATAATATTAAAAGg 

tangtataaaamanaccactaacga^^ 

ATTATTTAATGAGTAACATAAAGgtaatatgccaaatttttttaccanaanaacaatcagATTTCAGAAATTGAA 1 UG<~ I AO i 
rCTTGGAAAAAGGTCAAATGCGAAAATGTGCTTAAGTGATTTTGAGA AACGCAAG CAAATATTTGCGG 

AATTCATCTACTGGCTATACAATTCGTTTATAATACCTATTTTACAATC I 1 1 I l l l I ATATCACTGAATC 

AAGTGATTTACGAAATCGAACTGTTTATTTTAGAAAAGATATTTGGAAACTCTTGTGCCGACCCTTTAT 

TACATCAATGAAAATGGAAGCGTTTGAAAAAATAAACGAGgtanttaaagtattttttgcaaaaagctaatatmcagAACAA 

TGTTAGGATGGATACTCAGAAAACTACTTTGCCTCCAGCAGTTATTCGTCTATTACCTAAGAAGAATAC 

CTTTCGTCTCATTACGAATTTAAGAAAAAGATTCTTAATAAAGgtanaatttttggtcatcaatgtactnacnctaatctattattag 

caRATGGGTTCAAACAAAAAAATGTTAGTCAGTACGAACCAAACTTTACGACCTGTGGCATCGATACTG 

aLcatttaatcaatgaagaaagtagtggtattccatttaacttggaggtttacatgaagcttcttact 

TTTAAGAAGGATCTTCTTAAGCACCGAATGTTTGGgtaattatataatgcgcgattcctcattanaatmgcagGCGTAAGAAG 

TATTTTGTACGGATAGATATAAAATCCTGTTATGATCGAATAAAGCAAGATTTGATGTTTCGGATTGTT 

AAAAAGAAACTCAAGGATCCCGAATTTGTAATTCGAAAGTATGCAACCATACATGCAACAAGTGACCG 

AGCTACAAAAAACTTTGTTAGTGAGGCGTTTTCCTATTgtaagmatttmcattggaatttmaacaaanctttmagTT^ 

GGTGCCTmGAAAAAGTCGTGCAGTTACTTTCTATGAAAACATCAGATACTTTGTTTGTTGAT^T 

GGATTATTGGACCAAAAGTTCTTCTGAAATTTTTAAAATGCTCAAGGAACATCTCTC 

TACGAAAAAGAAAGGATCAGTGTTGTTACGAGTAGTCGACGATTTCCTCTTTATAACAG^TAAAA 

AGGATGCAAAAAAATTTTTGAATTTATCTTTAAGAGgtgagttgctgtcattcctaagnctaaccgngaagGATTTGAGAA 

ACACAATTTrrCTACGAGCCTGGAGAAAACAGTAATAAACTTTGAAAATAGTAATGGGATAATAAACA 

ATACTrTTTTTAATGAAAGCAAGAAAAGAATGCCATTCTTCGGTTTCTCTGTGAACATGAGGTCTCTTG 

ATACATTGTTAGCATGTCCTAAAATTGATGAAGCCTTATTTAACTCTACATCTGTAGAGCTGACGAAAC 

ATATGGGGAAATC I1 I I I t I 1 ACAAAATTCTAAGgtatactgtgtaactgaataatagctgacaaataatcagATCGAGCCl lOO 

ATCCTTTGCACAAGTATTTATTGACATTACCCACAATTCAAAATTCA^ 

GCTAGGATACTCTATGTGTATGAGAGCACAAGCATACTTAAAAAGGATGAAGGATATATTTATTCCCC 

AAAGAATGTTCATAACGGgtgagtacttattttaactagaaaagtcattaattaaccttagATCTTTTGAATGTTATTGGAAGAAAA 

ATrTG^AAAAAGTTGGCCGAAATATTAGGATATACGAGTAGGCGTTTCTTGTCCT 
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ATGetacetetcegtctcgagacttcagcaatangacacatcagGCTTTTTTGTCTTGGAATGAGAGATGGTTTCAAACCCTCTT 

TCAAATATCATCCATGCTTCGAACAGCTAATATACCAATTTCAGTCATTGACTGATCTTATCAAGCCGC 

TAAGACCAGTTTTGCGACAGGTGTTATTTTTACATAGAAGAATAGCTGATTAAtglcatmcaatttattatatacatcctt 

tanactggtgtcttaaacaatanattacuagtatagctgacccccaaagcaagcatactataggatttctagtaaagtaaaanaatctcgnanagtmgattgacttgtctn 

atccnatactmaagaaagattgacagtggngctgactactgcccacatgcccattaaacgggagtggnaaacanaaaagtaatacatgaggctaatctcctttcatttag 

aataaggaaagtggtmctataatgaataatgcccgcactaatgcaaaaagacgaagattatcttctaaacaaggggganaagcatatccgaaggaaaagagagtaatat 

acccagtgttgngaagaaagcaaggataamggaacaagcttctgcagatgacaggctaaattttggtgaccgaattttggtaaaagccccaggnatccatggtggccg 

gccngctactgagacgaaaagaaactaaggatagtttgaatactaatagctcatttaatgtcnatataaggnngmtncctgacttcaatmgcatgggtgaaaagaaata 

gtgnaagccanangganccgaaatagccaaamcnggttcctcaaagcggaagtctaaagaacttattgaagcttatgaggcttcaaaaactcctcctgamaaaggag 

gaatcttccaccgatgaggaaatggatagcttatcagctgctgaggagaagcctaattttttgcaaaaaagaaaatatcangggagacatctcngatgaatcagatgcgga 

gagtatctccagcggatccngatgtcaataacnctatnctgaaatgtatggtcctactgtcgcttcgacttctcgtagctctacgcagitaagtgaccaaaggtacc 
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S- 1 : FFY VTE TTF QKN RLF FYR KS V WSK 
S-2: RQHLKRVQLRDVSEAEVRQHREA 
S-3: ART FRR EKR AER LTS RVK ALF SVL NYE 

A-l: AJCF LHW LMS VYV VEL LRS FFY VTE '.[TF Q . 
A-2: LFF YRK SVW SKL QSI GIR QHL KRV QLR DVS 
A-3: PAL LTS RLR FIP KPD GLR PIV NMD YW 
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Vector Genomic DNA Insert 



Vector 



A5 



5825 bp Sequenced 
-2 kb Hind III Fragment 




1 kb 



RT Motifs 1 2 3(A) 4(B') 5(C) 6(D) 



Introns 



2 J 4 



<;DNA 
5-2C 



CDNA 



n r 



RT-PCR w/ M2-B14 
RT-PCR w/ M2-B15 Be 
RT-PCR w/ M2-B15 Be 
RT-PCR w/ M2-B16 Be 



nc A 
nc B 
nc C 



Hind I 




500 bp 



*0 

m 



m 
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Poly 4 

t t c 

t a a g c c teg 
5'- cag acc aaa gga att cca taa gg -3' 
QTKGIPQG 



4<B') 



P 5 (c') 



jjfj 3'- ctg ctg atg gag gag tag tgg -5' 

H a a aaaaa a a 

|* t t t t 

c c 
Polv 1 
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PCR Product M2 showed Reasonable Match 
with Other Telomerase Proteins 

Ot LCVSYILSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 

Ea_pl23 KGIPQGLCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLTDDYLLIT 

Sp _M2 SILSSFLCHFYMEDLIDEYLSFTKKK GSVLLRW 

Sc_pl03 DGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPS QDTLI LKLADDFLI I S 



QKVGIPQG 
j<^aa aaa get ggt acc cct cag gg 

t t c 
\% a a g c c teg 
■|^ag acc aaa gga att cca taa gg > 

H. i 

^ag acc aaa gga acc cca cca ggC TCA ATT 
^tc tgg ttt ccc caa ggt agt ccG AGT TAA 

v 

P KGIPSGSI 



j'!3>AA GAT TTG ATT GAT GAA TAC CTA TCG TTT 
hiCTT CTA AAC TAA CTA CTT ATG GAT AGC AAA 

EDLI DEYLSF 



< Actual Genomic Sequence. 



CTG TCA TCT TTT TTG TGT CAT TTC TAT ATG 
GAC AGT AGA AAA AAC ACA GTA AAG ATA TAC 

LSSFLCHFYM 

ACG AAA AAG AAA GGA TCA GTG TTG TTA CGA 
TGC TTT TTC TTT CCT AGT CAC AAC AAT GCT 

TKKKGSVLLR 



GTA GTC gac 
CAT CAG ctg 

V V D 

< ctg 

a 



gac tac ccc 
ctg atg gag 

D Y L 

ctg atg gag 
a aaa 
t 
c 



etc 


ate 


acc 


gag 


tag 


tgg. 


L 


I 


T 


gag 


tag 


tgg 


a a 


a 


a 


t 


t 


t 


c 







Polv 1 



gac gat ttc ccc ttt ata aca < Actual Genomic Sequence. 

D D F L F I T 
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3' RT PCR Strategy 



mRNA 



IAAAAAAAAAAAAAAAA 



1 . Synthesis o( cDN A with Qt Primer. 
mRNA 



m 



I ifi 

m 



| AAAAAAAAAAAAXAAA 

I yr r i ' l 'T TlTT 1 1 1 1 T 



2. First Round PCR Using Outside Primer and Qq Primer 




TTTTTT T ' lTl " ! T T T 1 



3. Second Round PCR Using Inside Primer and Q| Primer. 



m 



4. Sequence 



Second Round PCR Products Using Inside Primer or Q| Primer. 
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-Size Selected Libraries from P. Nurese 
3~4kb 
5-6kb 
7 - 8kb 
11 - 12 kb 

libraries from J. A. Wise 
Sau 3a Partial Digest 
Hind III Partial Digest 

cDNA Libraries 

GAD (Gal Activation Domain) Library 
REP Library from R. Allshire 
REP81ES Library (old) 
REP81ES Library (new) 
REP41ES Library 



c. 




ip 

VI 



D- 



^Original PCR 

III 3* RT-PCR 

m 




500 bp 



Mg Cone 




% 
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5' RT PGR Strategy 



mRNA 



I AAAAAAAAAAAAAAAA 



ddA- 



: 'fcr 



1 . Synthesis of cDNA with Specific Downstream Primer. 

mRNA 



5* 
3* 



I AAAAAAAAAAAAAAAA 



u 

.ft 



is 
.p 



2. Ugate Oligo with 5'-P and blocked 3* to cDNA using T4 RNA Ligase. 



3. First Round PCR 



4. Second Round PCR 
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Alignment of RT Domains from Telomerase Catalytic Subunits. 

Motif 0 

S.p. Tezlp (429). WLYNSFIIPILQSFFYITESSDLRNRTVYFRKDIW ...(35)... 
S.c. Est2p (366). WLFRQLI PKI IQTFFYCTEI3STVT- Iv'YFRHDTW ...(35)... 
E.a. pl23 (441). WIFEDLWSLIRCFFYVTEQQKSYSKTYYYRKNiW ...(35)... 

* ***** *** 

Motif 1 Motif 2 K 
p hh h K hR h R 
S.p. Tezlp AVIRLLPKK--NTFRLITN-LRKRF ...(61)... 
S.c. Est2p SKMRI I PKKSNNEFRIIAIPCRGAD ...(62)... 
E.a. pl23 GK.LRLIPKK--TTFRPIMTFNKKIV ...(61)... 



Motif 3(A) AF 

h hDh GY h 

? S.p. Tezlp KKYFVRIDIKSCYDRIKQDLMFRIVK ...(89)... 
M S.C. Est2p EL YFMKFDVKSCYDSI PRMECMRI LK ...(75)... 
in E.a. pl23 KLFFATMDIEKCYDSVNREKLSTFLK ...(107)... 
,; ***** * 

Motif 4 (B' ) 

hPQG pP hh h 
S.p. Tezlp YLQKVG I PQG S I LS SFLCHF YMEDL I DE Y LS F ...(6). 
S.c. Est2p Y IREDGLFQGSSLSAPIVDLVYDDLLEFYSEF ...(8). 
f. E.a. pl23 YKQTKG I PQGLCV S S I L S SF YYATLEES S LGF ...(14) 



Y Motif 5(C) Motif 6(D) 

3 h F DDhhh Gli h cK h 

* S.p. Tezlp VLLRWDDFLFITVNKKDAKKFLNLSLRGFEKHNFSTSLEKTVINFENS .(205) 
S.c. Est2p LILKLADDFLIISTDQQQVINIKKLAMGGFQKYI^AKAI^IRDKILAVSSQS .(173) 
E.a. pl23 LLMP.LTDbYLLITTQENNAVLFIEKLINVSRENGF'KFMMKKLQTSFPLS .(209) 



/ % 
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FIGURE 43 

Disruption strategy for the putative telomerase genes. 



wild type telomerase gene 



wild type telomerase gene 



1 . Transform with linear fragment 
containing the telomerase gene 
disrupted with a LEU2 or ura4 marker. 
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2. Assay in selective media. 

3. Sporulate, and grow on 
selective media. 
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(These cells will show a senescence phenotype 

if the disrupted gene encodes a telomerase subunit.) 
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FIGURE 44 
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Tezl disruption causes progressive 
shortening of telomeres in S. pombe 
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FIGURE 46 



1 oataccaatttactttcctttcttcataagctaattgcttcctcgaacgctcctaaatctctggaaatatttttacaaga 80 
81 actcaataacaataccaagtcaaattccaatatgaaggtgttattagtgatcgataatatttctattttatcggtcgtta 160 

1 ccaagtrtaagga^^ \\° Q 
241 ggttcgcttacttttaatcgtggtactgttttagctgctacttctagccaaccgcgtgtttctaccccgtcattggatat 320 

321 agctc^tggagtagctcacagaaatccttacaaatcttctgatgagactatattagattcattacagtccgtgcatattc 400 

01 ttaacatggagccttacactttagatgagtcacgtcgcatgatggagtatttggtatcatccaacgtttgccttgaaaag 480 

481 gttgataattatttgcaaaatcatgtccttagtggtggtaatccgcgaaagttttttgatgcttgcacacgtctagcatg 560 

561 attgagatattcaaaaatttctatccactacaactcctttaacgcggttttatttttctattttctattctcatgttgtt 640 

641 ccaLLtgtatcatctcgtattaggcttttttccgttttactcctggaatcgtacctttttcactattccccctaatga 720 

721 ataatctaaattagtttcgcttataattgatagtagtagaaagattggtgattctactcgtgtaatgttattagtttaaa 800 

801 gatactttgcaaaacatttattagctatcattatataaaaaaaatcctataattataaatattaatcaatatttgcggtc 880 

881 Ltatttatttaaaacgttatgatcagtaggacactttgcatatatatagttatgcttaatggttacttgtaacttgc 958 

959 ATG ACC GAA CAC CAT ACC CCC AAA AGC AGG ATT CTT CGC TTT CTA GAG AAT CAA TAT GTA 1018 
1M TEHHTPKSRILRFLENQYV 20 

M19 TAG CTA TGT ACC TTA AAT GAT TAT GTA CAA CTT GTT TTG AGA GGG TCG CCG GCA AGC TCG 1078 
y LCTLNDYVQ LVLRGS PASS 40 

$79 TAT AGC AAT ATA TGC GAA CGC TTG AGA AGC GAT GTA CAA ACG TCC TTT TCT ATT TTT CTT 1138 
4l Y S N I C E R L R S D V Q T S F S I F L 60 

iiU 

$39 CAT TCG ACT GTA GTC GGC TTC GAC ACT AAG CCA GAT GAA GGT GTT CAA TTT TCT TCT CCA 
H] 61 HSTVV GFDSKPDEGVQFSSP 80 
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FIGURE 46 (cont.) 



1199 AAA TGC TCA CAG TCA GAG gtatatata tt tttgt tttgatttt tt tctattcgggatagctaata tatgggcag 1272 
81 K C S 0 S E 86 

127 3 CTA ATA GCG AAT GTT GTA AAA CAG ATG TTC GAT GAA AGT TTT GAG CGT CGA AGG AAT CTA 1332 
87 L I ANVVKQM F DES FERRRN L 106 

1333 CTG ATG AAA GGG TTT TCC ATG gtaaggtat tctaattgtgaaatat t tacc tgcaattactgt t tcaaagaga 1405 
107 L M K G F S M 113 

1406 ttgtatttaaccgataaag AAT CAT GAA GAT TTT CGA GCC ATG CAT GTA AAC GGA GTA CAA AAT 1469 
114 NHEDF RAM H.VNGVQN 128 

147 0 GAT CTC GTT TCT ACT TTT CCT AAT TAC CTT ATA TCT ATA CTT GAG TCA AAA AAT TGG CAA 1529 
129 DLVSTFPNYLISI LESKNWQ 148 

1530 CTT TTG TTA GAA AT gtaaataccggttaagatgttgcgcactttgaacaagactgacaagtatag T ATC GGC 1601 
149 L L L E I I G 155 



16Q2 AGT GAT GCC ATG CAT TAC TTA TTA TCC AAA GGA AGT ATT TTT GAG GCT CTT CCA AAT GAC 1661 

S§6 SDAMHYLLSKGSI FEALPND 175 

1&62 AAT TAC CTT CAG ATT TCT GGC ATA CCA CTT TTT AAA AAT AAT GTG TTT GAG GAA ACT GTG 17 21 

!l6 NYLQISGI PLFKNNVFEETV 195 

m 

1$22 TCA AAA AAA AGA AAG CGA ACC ATT GAA AC A TCC ATT ACT CAA AAT AAA AGC GCC CGC AAA 17 81 

^6 SKKRKRTI ETS IT QNKSARK 215 

1702 GAA GTT TCC TGG AAT AGC ATT TCA ATT AGT AGG TTT AGC ATT TTT TAC AGG TCA TCC TAT 1841 

M6 EVSWNSISISRFSIFYRSSY 235 

1^42 AAG AAG TTT AAG CAA G gtaac taatactgttatccttcataac taatt t tag AT CTA TAT TTT AAC 
iWhl 

©6 K K F K Q D L Y F N 245 

1908 TTA CAC TCT ATT TGT GAT CGG AAC ACA GTA CAC ATG TGG CTT CAA TGG ATT TTT CCA AGG 1967 

246 LHSICDRNTVHMWLQWIF PR 265 

1968 CAA TTT GGA CTT ATA AAC GCA TTT CAA GTG AAG CAA TTG CAC AAA GTG ATT CCA CTG GTA 2027 

266 QFG LIN AFQVKQLHKVI PLV 285 



2028 TCA CAG AGT ACA GTT GTG CCC AAA CGT CTC 
286 SQSTVVPKRL 

2088 GCA AAG CGA CTC CAT CGT ATT TCT CTA TCA 
306 AKRLHRISLS 

214 8 GAC ACC CAC GAT GAT GAA AAA ATC CTT AGT 
326 DTHDDEKI LS 

2208 TTT CTT CGA TCC ATT CTT GTT CGA GTG TTT 
346 FLRSILVRVF 



CTA AAG GTA TAC CCT TTA ATT GAA CAA ACA 2087 

LKVY PLIEQT 305 

AAA GTT TAC AAC CAT TAT TGC CCA TAT ATT 2147 

KVYNHYCPYI 325 

TAT TCC TTA AAG CCG AAC CAG GTG TTT GCG 2207 

YSLK PNQVFA 345 

CCT AAA TTA ATC TGG GGT AAC CAA AGG ATA 2267 

PKLIWGNQRI 365 



2268 TTT GAG ATA ATA TTA AAA G gtat tgtataaaatttattaccac taacgatt t taccag AC CTC GAA ACT 2336 
366 FEIILKD LET 375 



FIGURE 46 (cont.) 



TTC TTG AAA TTA TCG AGA TAC GAG TCT TTT AGT TTA CAT TAT TTA ATG AGT AAC ATA AAG 2396 

376 V L K L S R Y E S F S L H Y L M S N I K 395 

2397 gtaatatgccaaatttttttaccattaattaacaatcag ATT TCA GAA ATT GAA TGG CTA GTC CTT GGA 2465 
396 

9466 AAA AGG TCA AAT GCG AAA ATG TGC TTA AGT GAT TTT GAG AAA CGC AAG CAA ATA TTT GCG 2525 

406 K R S N A K M C L S D F E K R K Q I F A 425 

2526 GAA TTC ATC TAC TGG CTA TAC AAT TCG TTT ATA ATA CCT ATT TTA CAA TCT TTT TTT TAT 2585 

426 EFIYWLYNSFII PI LQSFFY 445 

2586 ATC ACT GAA TCA AGT GAT TTA CGA AAT CGA ACT GTT TAT TTT AGA AAA GAT ATT TGG AAA 2645 

446 ITESSDLRNRTVYFRKDIWK 465 

2646 CTC TTG TGC CGA CCC TTT ATT ACA TCA ATG AAA ATG GAA GCG TTT GAA AAA -ATA AAC GAG 2705 

4 66 L LC R P F I T S M KM E A F E K I N E 485 

...2706 gtattttaaagtattttttgcaaaaagctaatattttcag AAC AAT GTT AGG ATG GAT ACT CAG AAA ACT 2775 
U ._, NNVRMDTQKT 

IB 2776 ACT TTG CCT CCA GCA GTT ATT CGT CTA TTA CCT AAG AAG AAT ACC TTT CGT CTC ATT ACG 2835 

=P 496 T L P P A V I R L L P K K N T F R L I T 515 

l«J 

17*2836 AAT TTA AGA AAA AGA TTC TTA ATA AAG gtattaatttttggtcatcaatgtactttacttctaatctatta 2906 

i - r» r T V 524 

^ 516 N L R K R F L I K 

in 

- 2907 ttagcag ATG GGT TCA AAC AAA AAA ATG TTA GTC AGT ACG AAC CAA ACT TTA CGA CCT GTG 2967 

Q 525 MGSNKKMLVSTNQTLRPV 542 

its 

IP2968 GCA TCG ATA CTG AAA CAT TTA ATC AAT GAA GAA AGT AGT GGT ATT CCA TTT AAC TTG GAG 3027 

|lS 543 A S I L K H L I N E E S S G I P F N L E 562 

S 3028 GTT TAC ATG AAG CTT CTT ACT TTT AAG AAG GAT CTT CTT AAG CAC CGA ATG TTT GG gtaat 3088 

563 VYMKLL .TFKKDLLKHRMFG 581 

3089 tatataatgcgcgattcctcattattaattttgcag G CGT AAG AAG TAT TTT GTA CGG ATA GAT ATA 3155 

582 R K K Y F V R I D I 591 

3156 AAA TCC TGT TAT GAT CGA ATA AAG CAA GAT TTG ATG TTT CGG ATT GTT AAA AAG AAA CTC 3215 

592 KSC YDR I K QD LM FR I V K K K L 611 

3216 AAG GAT CCC GAA TTT GTA ATT CGA AAG TAT GCA ACC ATA CAT GCA ACA AGT GAC CGA GCT 3275 
612 K D P E F V I R K Y A T I H A T S D R A 631 

3276 ACA AAA AAC TTT GTT AGT GAG GCG TTT TCC TAT T gtaagtttattttttcattggaattttttaacaa 3343 
632 TKNFVSEAFSYF 

3344 attcttttttag TT GAT ATG GTG CCT TTT GAA AAA GTC GTG CAG TTA CTT TCT ATG AAA ACA 3405 

DM VPFEKVVQLLSMKT6b9 



644 



3406 TCA GAT ACT TTG TTT GTT GAT TTT GTG GAT TAT TGG ACC AAA AGT TCT TCT GAA ATT TTT 3465 

VDFVDYWTKSSSE IF b'» 



660 S D T h F 

3466 AAA ATG CTC AAG GAA CAT CTC TCT GGA CAC ATT GTT AAG gtataccaattgttgaattgtaataaca 3532 
680 KMLKEHLSGHIVK 
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FIGURE 46 (cont.) 

3533 ctaatgaaactag ATA GGA AAT TCT CAA TAC CTT CAA AAA GTT GGT ATC CCT CAG GGC TCA 3593 

693 I G N S Q Y L Q K V G I P Q G S 708 

,, q , ATT CTG TCA TCT TTT TTG TGT CAT TTC TAT ATG GAA GAT TTG ATT GAT GAA TAC CTA TCG 3653 

109 J L S S F L C H F Y M E D L I D E Y L S 728 

3654 TTT ACG AAA AAG AAA GGA TCA GTG TTG TTA CGA GTA GTC GAC GAT TTC CTC TTT ATA ACA 3713 

729 FTKKKGSVLLRVVDDFLFIT 748 

3714 GTT AAT AAA AAG GAT GCA AAA AAA TTT TTG AAT TTA TCT TTA AGA G gtgagttgctgtcattcc 3777 

749 VNKKDAKKFLNLSL RG 764 

3778 taagttctaaccgttgaag GA TTT GAG AAA CAC AAT TTT TCT ACG AGC CTG GAG AAA ACA GTA 3840 

7 « FEKHNFSTSLEKTV 778 



3841 
779 


ATA 
I 


AAC 
N 


TTT 
F 


GAA 
E 


AAT 
N 


AGT 
S 


AAT 
N 


GGG 
G 


ATA 
I 


ATA 
I 


AAC 
N 


AAT 
N 


ACT 
T 


TTT 
F 


TTT 
F 


AAT 
N 


GAA 
E 


AGC AAG AAA 
S K K 


3900 
798 


3901 

-, if? - / 


AGA 
R 


ATG 
M 


CCA 
P 


TTC 
F 


TTC 
F 


GGT 
G 


TTC 
F 


TCT 

S 


GTG 
V 


AAC 
N 


ATG 
M 


AGG 
R 


TCT 
S 


CTT 
L 


GAT 
D 


ACA 

T 


TTG 
L 


TTA GCA TGT 
LAC 


3960 
818 


IP 61 
,£819 


CCT 
P 


AAA 
K 


ATT 
I 


GAT 
D 


GAA 
E 


GCC 
A 


TTA 
L 


TTT 
F 


AAC 
N 


TCT 
S 


ACA 
T 


TCT 

S 


GTA 
V 


GAG 
E 


CTG 
L 


ACG 
T 


AAA 
K 


CAT ATG GGG 
H M G 


4020 
838 


j«021 
%j839 


AAA 
K 


TCT 

S 


TTT 
F 


TTT 
F 


TAC 
Y 


AAA 
K 


ATT 
I 


CTA 
L 


AG gtatactgtgtaactgaataatagctgacaaataatcag A TCG 
R S 


4089 
848 


m 

.J090 
i.^849 


AGC 
S 


CTT 
L 


GCA 
A 


TCC 
S 


TTT 
F 


GCA 
A 


CAA 
Q 


GTA 
V 


TTT 
F 


ATT 
I 


GAC 
D 


ATT 
I 


ACC 
T 


CAC 
H 


AAT 
N 


TCA 

S 


AAA 
K 


TTC AAT TCT 
F N S 


4149 
868 


jpiso 
ii5 869 


TGC 

C 


TGC 
C 


AAT 
N 


ATA 
I 


TAT 
Y 


AGG 
R 


CTA 
L 


GGA 
G 


TAC 
Y 


TCT 
S 


ATG 
M 


TGT 
C 


ATG 
M 


AGA 
R 


GCA 
A 


CAA 
Q 


GCA 
A 


TAC TTA AAA 
Y L K 


4209 
888 


P4210 
! * 889 


AGG 
Ft 


ATG 
M 


AAG 
K 


GAT 
D 


ATA 
I 


TTT 
F 


ATT 
I 


CCC 
P 


CAA 
Q 


AGA 
R 


ATG 
M 


TTC 
F 


ATA ACG G gtgagtacttattttaactaga 
I T D 


4274 
903 


4275 
904 


aaagtcattaattaaccttag AT CTT TTG AAT GTT ATT GGA AGA AAA ATT TGG AAA AAG TTG GCC 

LLNVIGRKIWKKLA 


4339 
917 


4340 
918 


GAA 
E 


ATA 
I 


TTA J3GA 
L G 


TAT 
Y 


ACG 
T 


AGT 

S 


AGG 
R 


CGT 
R 


TTC 
F 


TTG 
L 


TCC 
S 


TCT 

S 


GCA 
A 


GAA 
E 


GTC 
V 


AAA 
K 


TG gtacgtgtc 
w 


4401 
935 


4402 ggtctcgagacttcagcaatattgacacatcag G 
936 


CTT 
L 


TTT 
F 


TGT 
C 


CTT 
L 


GGA 
G 


ATG 
M 


AGA 
R 


GAT 
D 


GGT TTG AAA 
G L K 


4468 
946 


4469 
947 


CCC 
P 


TCT 

S 


TTC 
F 


AAA 
K 


TAT 
Y 


CAT 
H 


CCA 
P 


TGC 
C 


TTC 
F 


GAA 
E 


CAG 
Q 


CTA 
L 


ATA 
I 


TAC 
Y 


CAA 
Q 


TTT 
F 


CAG 
Q 


TCA TTG ACT 
S L T 


4528 
966 


4529 
967 


GAT 
D 


CTT 
L 


ATC 
I 


AAG 
K 


CCG 
P 


CTA 
L 


AGA 
R 


CCA 
P 


GTT 
V 


TTG 
L 


CGA 
R 


CAG 
Q 


GTG 
V 


TTA 
L 


TTT 
F 


TTA 
L 


CAT 
H 


AGA AGA ATA 
R R I 


4588 
986 



4589 GCT GAT TAA tgtcattttcaatttattatatacatcctttattactggtgtcttaaacaatattattactaagtata 4665 



987 A D 



FIGURE 46 (cont.) 



4666 gctgacccccaaagcaagcatactataggatttctagtaaagtaaaattaatctcgttattagtcttgattgacttgtct 4745 
4746 ttatccttatacttttaagaaagattgacagtggttgctgactactgcccacatgcccatcaaacgggagtggttaaaca 4825 
4826 ttaaaagtaatacatgaggctaatctcctttcatttagaataaggaaagtggttttctataatgaataatgcccgcacta 4905 

4906 atgcaaaaagacgaagattatcttctaaacaagggggattaagcatatccgaaggaaaagagagtaatatacccagtgtt 4 985 
4986 gttgaagaaagcaaggataatttggaacaagcttctgcagatgacaggctaaattttggtgaccgaactttggtaaaagc 5065 
5066 cccaggttatccatggtggccggccttgctactgagacgaaaagaaactaaggatagtttgaataccaatagctcattta 514 5 
514 6 atgtcttatataaggttttgttttttcctgacttcaattttgcatgggtgaaaagaaatagtgttaagccattattggat 5225 
5226 tccgaaatagccaaatttcttggttcctcaaagcggaagtctaaagaacttattgaagcttatgaggcttcaaaaactcc 5305 
5306 tcctgatttaaaggaggaatcttccaccgatgaggaaatggatagcttatcagctgctgaggagaagcctaattttttgc 5385 
5386 aaaaaagaaaatatcattgggagacatctcttgatgaatcagatgcggagagtatctccagcggatccttgatgtcaata 54 65 
5466 acttctatttctgaaatgtatggtcctactgtcgcttcgacttctcgtagctctacgcagttaagtgaccaaaggtacc 5544 
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i 

met ser val tyr val val glu leu leu 

GCCAAGTTCCTGCACTGGCTG ATG AGT GTG TAC GTC GTC GAG CTG CTC 

10 20 

arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 

AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG 



leu phe phe tyr arg 
CTC TTT TTC TAC CGG 

40 

gly ile arg gin his 
GGA ATC AGA CAG CAC 



30 

lys ser val trp ser 
AAG AGT GTC TGG AGC 



leu lys arg val gin 
TTG AAG AGG GTG CAG 



lys leu gin ser ile 
AAG TTG CAA AGC ATT 

50 

leu arg glu leu ser 
CTG CGG GAG CTG TCG 



glu ala glu 
GAA GCA GAG 

70 

thr ser arg 
ACG TCC AGA 



ile val asn 
ATT GTG AAC 

100 

glu lys 
GAA AAG ARG 



ser val leu 
AGC GTG CTC 

130 

ala ser val 
GCC TCT GTG 



60 

val arg gin 
GTC AGG CAG 



leu arg phe 
CTC CGC TTC 

90 

met asp tyr 
ATG GAC TAC 



ala glu arg 
GCC GAG CGT 

120 

asn tyr glu 
AAC TAC GAG 



leu gly leu 
CTG GGC CTG 



his arg glu 
CAT CGG GAA 



ile pro lys 
ATC CCC AAG 



val val gly 
GTC GTG GGA 



leu thr ser 
CTC ACC TCG 



arg ala arg 
CGG GCG CGG 



asp asp ile 
GAC GAT ATC 



ala arg pro 
GCC AGG CCC 

80 

pro asp gly 
CCT GAC GGG 



ala arg thr 
GCC AGA ACG 

110 

arg val lys 
AGG GTG AAG 



arg pro gly 
CGC CCC GGC 

140 
his arg ala 
CAC AGG GCC 



ala leu leu 
GCC CTG CTG 



leu arg pro 
CTG CGG CCG 



phe arg arg 
TTC CGC AGA 



ala leu phe 
GCA CTG TTC 



leu leu gly 
CTC CTG GGC 



trp arg thr 
TGG CGC ACC 



phe val leu arg val 
TTC GTG CTG CGT GTG 

160 

phe val lys val asp 
TTT GTC AAG GTG GAT 



asp arg leu thr glu 
GAC AGG CTC ACG GAG 



150 

arg ala gin asp pro 
CGG GCC CAG GAC CCG 



val thr gly ala tyr 
GTG ACG GGC GCG TAC 

180 

val ile ala ser ile 
GTC ATC GCC AGC ATC 



pro pro glu leu tyr 
CCG CCT GAG CTG TAC 

170 

asp thr ile pro gin 
GAC ACC ATC CCC CAG 



ile lys pro gin asn 
ATC AAA CCC CAG AAC 
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190 

thr tyr cys val arg 
ACG TAC TGC GTG CGT 



gly thr ser ala arg 
GGC ACG TCC GCA AGG 

220 

gin gly ile pro gin 
CAG GGG ATC CCG CAG 



leu cys tyr gly asp 
CTG TGC TAC GGC GAC 

250 

arg asp gly leu leu 
CGG GAC GGG CTG CTC 



thr pro his leu thr 
ACA CCT CAC CTC ACC 

280 

arg gly val pro glu 
CGA GGT GTC CCT GAG 



val val asn phe pro 
GTG GTG AAC TTC CCT 

310 

phe val gin met pro 
TTT GTT CAG ATG CCG 



leu leu asp thr arg 
CTG CTG GAT ACC CGG 

340 

tyr ala arg thr ser 
TAT GCC CGG ACC TCC 



phe lys ala gly arg 
TTC AAG GCT GGG AGG 

370 

arg leu lys cys his 
CGG CTG AAG TGT CAC 



arg tyr ala val val 
CGG TAT GCC GTG GTC 

210 

pro ser arg ala thr 
CCT TCA AGA GCC ACG 



gly ser ile leu ser 
GGC TCC ATC CTC TCC 

240 

met glu asn lys leu 
ATG GAG AAC AAG CTG 



leu arg leu val asp 
CTG CGT TTG GTG GAT 

270 

his ala lys thr phe 
CAC GCG AAA ACC TTC 



tyr gly cys val val 
TAT GGC TGC GTG GTG 

300 

val glu asp glu ala 
GTA GAA GAC GAG GCC 



ala his gly leu phe 
GCC CAC GGC CTA TTC 

330 

thr leu glu val gin 
ACC CTG GAG GTG CAG 



ile arg ala ser leu 
ATC AGA GCC AGT CTC 

360 

asn met arg arg lys 
AAC ATG CGT CGC AAA 



ser leu phe leu asp 
AGC CTG TTT CTG GAT 



200 

gin lys ala ala met 
CAG AAG GCC GCC ATG 



ser tyr val gin cys 
TCC TAC GTC CAG TGC 

230 

thr leu leu cys ser 
ACG CTG CTC TGC AGC 



phe ala gly ile arg 
TTT GCG GGG ATT" CGG 

260 

asp phe leu leu val 
GAT TTC TTG TTG GTG 



leu arg thr leu val 
CTC AGG ACC CTG GTC 

290 

asn leu arg lys thr 
AAC TTG CGG AAG ACA 



leu gly gly thr ala 
CTG GGT GGC ACG GCT 

320 

pro trp cys gly leu 
CCC TGG TGC GGC CTG 



ser asp tyr ser ser 
AGC GAC TAC TCC AGC 

350 

thr phe asn arg gly 
ACC TTC AAC CGC GGC 



leu phe gly val leu 
CTC TTT GGG GTC TTG 

380 

leu gin val asn ser 
TTG CAG GTG AAC AGC 
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390 

leu gin thr val cys thr asn ile tyr lys ile leu leu leu gin 
CTC CAG ACG GTG TGC ACC AAC ATC TAC AAG ATC CTC CTG CTG CAG 



400 

ala tyr arg phe his 
GCG TAC AGG TTT CAC 



gin val trp lys asn 
CAA GTT TGG AAG AAC 

430 

arg leu pro leu leu 
CGG CTC CCT CTG CTA 



ala cys val leu gin 
GCA TGT GTG CTG CAG 

420 

pro his phe ser cys 
CCA CAT TTT TCC TGC 



leu his pro glu ser 
CTC CAT CCT GAA AGC 



410 

leu pro phe his gin 
CTC CCA TTT CAT CAG 



ala ser ser leu thr 

GCG TCA TCT CTG ACA 

440 

gin glu arg arg asp 

CAA GAA* CGC AGG .GAT 



val ala gly gly gin 
GTC GCT GGG GGC CAA 

460 

arg ala val ala val 
CGT GCA GTG GCT GTG 



450 

gly arg arg arg pro 
GGG CGC CGC CGG CCC 



pro pro ser ile pro 
CCA CCA AGC ATT CCT 



ser ala leu arg gly 
TCT GCC CTC CGA GGC 

470 

ala gin ala asp ser 
GCT CAA GCT GAC TCG 



480 

thr pro cys his leu arg ala thr pro gly val thr gin asp ser 
ACA CCG TGT CAC CTA CGT GCC ACT CCT GGG GTC ACT CAG GAC AGC 



490 500 

pro asp ala ala glu ser glu ala pro gly asp asp ala asp cys 
CCA GAC GCA GCT GAG TCG GAA GCT CCC GGG GAC GAC GCT GAC TGC 



510 

pro gly gly arg ser gin pro gly thr ala leu arg leu gin asp 
CCT GGA GGC CGC AGC CAA CCC GGC ACT GCC CTC AGA CTT CAA GAC 

520 530 

his pro gly leu met ala thr arg pro gin pro gly arg glu gin 
CAT CCT GGA CTG ATG GCC ACC CGC CCA CAG CCA GGC CGA GAG CAG 



thr pro ala ala leu 
ACA CCA GCA GCC CTG 

550 

arg gly gly pro his 
AGG GGC GGC CCA CAC 



540 

ser arg arg ala tyr 
TCA CGC CGG GCT TAT 



pro gly leu his arg 
CCA GGC CTG CAC CGC 



thr ser gin gly gly 
ACG TCC CAG GGA GGG 

560 

trp glu ser glu ala 
TGG GAG TCT GAG GCC 



564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 
CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 
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AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 
CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 
TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 
AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 
GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 
TTTCAGTTTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Motif -1 

Ep pi 23 ...LVVSLIRCFFYVTEQQKSYSKT... 

Sp Tezl ...FIIPILQSFFYITESSDLRNRT... 

Sc Est2 ...LIPKIIQTFFYCTEISSTVTIV... 

HsTCPl ...YVVELLRSFFYVTETTFQKNRL... 
consensus FFY TE 

K 

Motif 0 phhh K hRh R 

Ep pi 23 . . . KSLGFAPGKLRLI PKKT— TFRPIMTFNKKIV. . . 

Sp Tezl . . .QKTTLPP A VIRLLPKKN--TFRLITNLRKRFL. . . 

Sc Est2 ...TLSNFNHSKMRIIPKKSNNEFRIIAIPCRGAD.. 

Hs TCP1 ...ARPALLTSRLRFIPKPD--GLRPIVNMDYVVG. 
consensus R PK R I 

AF 

Motif A h hDh GY h 

Ep pl23 ...PKLFFATMDIEKCYDSVNREKLSTFLK... 

Sp Tezl ...RKKYFVRIDIKSCYDRIK.QDLMFRJVK... 

Sc Est2 ...PELYFMKFDVKSCYDSIPRMECMRILK... 

Hs TCP1 ... PEL YFVKVDVTGAYDTIPQDRLTEVIA.. .//... 
consensus F D YD 

Motif B hPQG pS hh 

Ep pl23 ...NGKFYKQTKGIPQGLCVSSILSSFYYA... 

Sp Tezl ...GNSQYLQKVGIPQGSILSSFLCHFYME... 

Sc Est2 ...EDKCYIREDGLFQGSSLSAPIVDLVYD... 

Hs TCP1 ...RATSYVQCQGIPQGSILSTLLCSLCYG... 
consensus G QG S 

Y 

Motif C h F DDhhh 

Eppl23 ...PNVNLLMRLTDDYLLITTQENN... 

SpTezl ...KKGSVLLRVVDDFLFITVNKKD... 

Sc Est2 ...SQDTLILKLADDFLIISTDQQQ... 

HsTCPl ...RRDGLLLRLVDDFLLVTPHLTH... 
consensus DD L 

Motif D Ghh cK 

Eppl23 ...NVSRENGFKFNMKKL... 

SpTezl ...LNLSLRGFEKHNFST... 

Sc Est2 ...KKLAMGGFQKYNAKA... 

Hs TCP 1 ... LRTLVRG VPE YGC V V . . . 
consensus G 
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1 GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC 
5 1 CCGCGATGCC GCGCGCTCCC CGCTGCCGAG CCGTGCGCTC CCTGCTGCGC 
1 0 1 AGCC ACTACC GCG AGGTGCT GCCGCTGGCC ACGTTCGTGC GGCGCCTGGG 
1 5 1 GCCCCAGGGC TGGCGGCTGG TGCAGCGCGG GGACCCGGCG GCTTTCCGCG 
201 CGNTGGTGGC CCANTGCNTG GTGTGCGTGC CCTGGGANGN ANGGCNGCCC 
25 1 CCCGCCGCCC CCTCCTTCCG CCAGGTGTCC TGCCTGAANG ANCTGGTGGC 
301 CCGAGTGCTG CANANGCTGT GCGANCGCGG CGCGAANAAC GTGCTGGCCT 
3 5 1 TCGGCTTCGC GCTGCTGGAC GGGGCCCGCG GGGGCCCCCC CG AGGCCTTC 
40 1 ACC ACCAGCG TGCGCAGCTA CCTGCCC AAC ACGGTGACCG ACGCACTGCG 
451 GGGGAGCGGG GCGTGGGGGC TGCTGCTGCG CCGCGTGGGC GACGACGTGC 
501 TGGTTCACCT GCTGGCACGC TGCGCGNTNT TTGTGCTGGT GGNTCCCAGC 
551 TGCGCCTACC ANGTGTGCGG GCCGCCGCTG TACCAGCTCG GCGCTGCNAC 
601 TCAGGCCCGG CCCCCGCCAC ACGCTANTGG ACCCGAANGC GTCTGGGATC 
65 1 C AACGGGCCT GGAACCATAG CGTCAGGGAG GCCGGGGTCC CCCTGGGCTG 
701 CCAGCCCCGG GTGCGAGGAG GCGCGGGGGC AGTGCCAGCC GAAGTCTGCC 
75 1 GTTGCCCAAG AGGCCCAGGC GTGGCGCTGC CCCTGAGCCG GAGCGGACGC 
80 1 CCGTTGGGCA GGGGTCCTGG GCCC ACCCGG GC AGGACGCC TGGACCGAGT 
85 1 GACCGTGGTT TCTGTGTGGT GTCACCTGCC AGACCCGCCG AAGAAGCCAC 
90 1 CTCTTTGG AG GGTGCGCTCT CTGGCACGCG CCACTCCCAC CCATCCGTGG 
95 1 GCCGCCAGCA CCACGCGGGC CCCCCATCCA CATCGCGGCC ACCACGTCCT 
1 00 1 GGGACACGCC TTGTCCCCCG GTGTACGCCG AGACCAAGC A CTTCCTCTAC 
1 05 1 TCCTCAGGCG AC A AGNACAC TGCGNCCCTC CTTCCTACTC AATATATCTG 
1101 AGGCCCAGCC TGACTGGCGT TCGGGAGGTT CGTGGAGAC A NTCTTTCTGG 
1151 TTCCAGGCCT TGGATGCCAG GATTCCCCGC AGGTTGCCCC GCCTGCCCCA 
1 201 GCGNTACTGG CAAATGCGGC CCCTGTTTCT GGAGCTGCTT GGGAACCACG 
1 25 1 CGCAGTGCCC CTACGGGGTG TTCCTCAAGA CGCACTGCCC GCTGCGAGCT 
1 301 GCGGTCACCC CAGCAGCCGG TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC 
1351 TGTGGCGGCC CCCGAGGAGG AGGAACACAG ACCCCCGTCG CCTGGTGCAG 
1401 CTGCTCCGCC AGC ACAGCAG CCCCTGGC AG GTGTACGGCT TCGTGCGGGC 
1 45 1 CTGCCTGCGC CGGCTGGTGC CCCCAGGCCT CTGGGGCTCC AGGCACAACG 
1501 AACGCCGCTT CCTC AGG AAC ACCAAGAAGT TC ATCTCCCT GGGGAAGCAT 
1 55 1 GCCAAGCTCT CGCTGCAGGA GCTGACGTGG AAGATGAGCG TGCGGGACTG 
1 601 CGCTTGGCTG CGC AGG AGCC CAGGGGTTGG CTGTGTTCCG GCCGCAGAGC 
1 65 1 ACCGTCTGCG TGAGGAGATC CTGGCC AAGT TCCTGCACTG GCTGATGAGT 
1 701 GTGTACGTCG TCGAGCTGCT CAGGTCTTTC TTTTATGTCA CGGAGACCAC 
1 75 1 GTTTCAAAAG AAC AGGCTCT TTTTCTACCG GAAGAGTGTC TGGAGCAAGT 
1 801 TGCAAAGCAT TGGAATCAGA CAGCACTTGA AGAGGGTGCA GCTGCGGGAG 
1851 CTGTCGGAAG CAGAGGTCAG GCAGCATCGG GAAGCCAGGC CCGCCCTGCT 
1 90 1 G ACGTCC AGA CTCCGCTTCA TCCCCAAGCC TGACGGGCTG CGGCCG ATTG 
1 95 1 TGAACATGGA CTACGTCGTG GGAGCCAGAA CGTTCCGCAG AGAAAAGAGG 
2001 GCCGAGCGTC TCACCTCGAG GGTGAAGGCA CTGTTCAGCG TGCTCAACTA 
205 1 CGAGCGGGCG CGGCGCCCCG GCCTCCTGGG CGCCTCTGTG CTGGGCCTGG 
2101 ACG ATATCCA C AGGGCCTGG CGC ACCTTCG TGCTGCGTGT GCGGGCCCAG 
2151 GACCCGCCGC CTG AGCTGTA CTTTGTCAAG GTGGATGTGA CGGGCGCGTA 
220 1 CGACACCATC CCCC AGGACA GGCTCACGGA GGTCATCGCC AGCATCATCA 
225 1 AACCCCAGAA CACGTACTGC GTGCGTCGGT ATGCCGTGGT CC AG AAGGCC 
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230 1 GCCCATGGGC ACGTCCGCAA GGCCTTC AAG AGCCACGTCT CTACCTTGAC 
23 5 1 AG ACCTCCAG CCGTAC ATGC GAC AGTTCGT GGCTC ACCTG C AGG ANAAC A 
2401 GCCCGCTGAG GGATGCCGTC GTCATCGAGC AGAGCTCCTC CCTGAATGAG 
245 1 GCCAGCAGTG GCCTCTTCGA CGTCTTCCTA CGCTTCATGT GCCACCACGC 
2501 CGTGCGCATC AGGGGCAAGT CCTACGTCCA GTGCCAGGGG ATCCCGCAGG 
25 5 1 GCTCC ATCCT CTCC ACGCTG CTCTGC AGCC TGTGCTACGG CG AC ATGG AG 
260 1 AACAAGCTGT TTGCGGGGAT TCGGCGGGAC GGGCTGCTCC TGCGTTTGGT 
265 1 GGATGATTTC TTGTTGGTGA CACCTCACCT CACCCACGCG AAAACCTTCC 
270 1 TC AGG ACCCT GGTCCG AGGT GTCCCTG AGT ATGGCTGCGT GGTG AACTTG 
275 1 CGGAAGACAG TGGTGAACTT CCCTGTAGAA GACGAGGCCC TGGGTGGCAC 
280 1 GGCTTTTGTT CAGATGCCGG CCCACGGCCT ATTCCCCTGG TGCGGCCTGC 
285 1 TGCTGGATAC CCGGACCCTG GAGGTGCAGA GCGACTACTC CAGCTATGCC 
290 1 CGGACCTCC A TCAGAGCCAG TCTC ACCTTC AACCGCGGCT TCAAGGCTGG 
295 1 GAGGAACATG CGTCGCAAAC TCTTTGGGGT CTTGCGGCTG AAGTGTCACA 
300 1 GCCTGTTTCT GGATTTGCAG GTGAACAGCC TCCAGACGGT GTGC ACCAAC 
305 1 ATCTACAAGA TCCTCCTGCT GCAGGCGTAC AGGTTTCACG CATGTGTGCT 
3101 GCAGCTCCC A TTTCATCAGC AAGTTTGGAA GAACCCC AC A TTTTTCCTGC 
3151 GCGTCATCTC TGACACGGCC TCCCTCTGCT ACTCCATCCT GAAAGCCAAG 
320 1 AACGCAGGGA TGTCGCTGGG GGCCAAGGGC GCCGCCGGCC CTCTGCCCTC 
325 1 CGAGGCCGTG CAGTGGCTGT GCCACCAAGC ATTCCTGCTC AAGCTGACTC 
3301 GACACCGTGT CACCTACGTG CCACTCCTGG GGTCACTCAG GACAGCCC AG 
3351 ACGCAGCTG A GTCGGAAGCT CCCGGGGACG ACGCTGACTG CCCTGG AGGC 
340 1 CGCAGCCAAC CCGGCACTGC CCTCAGACTT CAAGACCATC CTGG ACTGAT 
34 5 1 GGCCACCCGC CCACAGCCAG GCCGAG AGC A GAC ACC AGC A GCCCTGTC AC 
3501 GCCGGGCTCT ACGTCCCAGG GAGGGAGGGG CGGCCCACAC CCAGGCCCGC 
3551 ACCGCTGGGA GTCTGAGGCC TGAGTGAGTG TTTGGCCGAG GCCTGCATGT 
360 1 CCGGCTGAAG GCTGAGTGTC CGGCTGAGGC CTGAGCGAGT GTCCAGCCAA 
365 1 GGGCTGAGTG TCCAGCACAC CTGCCGTCTT CACTTCCCCA CAGGCTGGCG 
370 1 CTCGGCTCCA CCCCAGGGCC AGCTTTTCCT CACCAGGAGC CCGGCTTCCA 
375 1 CTCCCCACAT AGGAATAGTC CATCCCCAGA TTCGCCATTG TTCACCCCTC 
3801 GCCCTGCCCT CCTTTGCCTT CCACCCCCAC CATCCAGGTG GAGACCCTGA 
385 1 GAAGGACCCT GGGAGCTCTG GGAATTTGGA GTGACCAAAG GTGTGCCCTG 
3901 TACACAGGCG AGGACCCTGC ACCTGGATGG GGGTCCCTGT GGGTC AAATT 
395 1 GGGGGGAGGT GCTGTGGGAG TAAAATACTG AATATATGAG TTTTTCAGTT 
4001 TTGAAAAAAA AAAAAAAAAA AAAAAAAAA 
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FIGURE 53 



met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arc ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 
his tyr arg clu val leu pro leu ala thr phe val arg arg leu 
CAC TAG CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

aly pro gin gly trp arg leu val gin arg gly asp pro ala ala 
GGG CCC CAG GGC TGC- CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 



50 



60 



phe arg ala leu val ala gin cys leu val cys val pro trp asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

ala arg oro pro pro ala ala pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

80 90 
i eu ivs glu leu val ala arg val leu gin arg leu cys glu arg 
CTG AAG GAG CTG- GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

110 120 
ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 



I 



cyr leu pro asn thr val thr asp ala leu arg gly ser gly ala 
TAC CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 
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trp gly leu 
TGG GGG CTG 



leu leu ala 
CTG CTG GCA 



ala tyr gin 
GCC TAC CAG 



thr gin ala 
ACT CAG GCC 



leu gly cys 
CTG GGA TGC 



val pro leu 
GTC CCC CTG 



ser ala ser 
AGT GCC AGC 



ala ala pro 
GCT GCC CCT 



ala his pro 
GCC CAC CCG 



val val ser 
GTG GTG TCA 



140 

leu leu arg 
CTG CTG CGC 



arg cys ala 
CGC TGC GCG 

170 

val cys gly 
GTG TGC GGG 



arg pro pro 
CGG CCC CCG 

200 

glu arg ala 
GAA CGG GCC 



gly leu pro 
GGC CTG CCA 

230 

arg ser leu 
CGA AGT CTG 



glu pro glu 
GAG CCG GAG 

260 

gly arg thr 
GGC AGG ACG 



pro ala arg 
CCT GCC AGA 



arg val gly 
CGC GTG GGC 

160 

leu phe val 
CTC TTT GTG 



pro pro leu 
CCG CCG CTG 

190 

pro his ala 
CCA CAC GCT 



trp asn his 
TGG AAC CAT 

220 

ala pro gly 
GCC CCG GGT 

pro leu pro 
CCG TTG CCC 

250 

arg thr pro 
CGG ACG CCC 



arg gly pro 
CGT GGA CCG 

280 

pro ala glu 
CCC GCC GAA 



asp asp val 
GAC GAC GTG 



leu val ala 
CTG GTG GCT 



tyr gin leu 
TAC CAG CTC 



ser gly pro 
AGT GGA CCC 



ser val arg 
AGC GTC AGG 



ala arg arg 
GCG AGG AGG 



lys arg pro 
AAG AGG CCC 



val gly gin 
GTT GGG CAG 



ser asp arg 
AGT GAC CGT 



glu ala thr 
GAA GCC ACC 



150 
leu val his 
CTG GTT CAC 



pro ser cys 
CCC AGC TGC 

180 

gly ala ala 
GGC GCT GCC 



arg arg arg 
CGA AGG CGT 

210 
glu ala gly 
GAG GCC GGG 



arg gly gly 
CGC GGG GGC 

240 

arg arg gly 
AGG CGT GGC 

gly ser trp 
GGG TCC TGG 

270 
gly phe cys 
GGT TTC TGT 



ser leu glu 
TCT TTG GAG 
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290 

giy ala leu ser gly 
GGT GCG CTC TCT GGC 



gin his his ala gly 
CAG CAC CAC GCG GGC 

320 

trp asp chr pro cys 
TGG GAG ACG CCT TGT 



leu tyr ser ser gly 
CTC TAG TCC TCA GGC 

350 

leu ser ser leu arg 
CTC AGC TCT CTG AGG 



glu thr ile phe leu 
GAG ACC ATC TTT CTG 

380 

arg arg leu pro arg 
CGC AGG TTG CCC CGC 

leu phe leu glu leu 
CTG TTT CTG GAG CTG 

410 

val leu leu lys thr 
GTG CTC CTC AAG ACG 



ala ala gly val cys 
GCA GCC GGT GTC TGT 



thr arg his ser his 
ACG CGC CAC TCC CAC 

310 

pro pro ser thr ser 
CCC CCA TCC ACA TCG 



pro pro val tyr ala 
CCC CCG GTG TAC GCC 

340 

asp lys glu gin leu 
GAC AAG GAG CAG CTG 

pro ser leu thr gly 
CCC AGC CTG ACT GGC 

370 

gly ser arg pro trp 
GGT TCC AGG CCC TGG 



leu pro gin arg tyr 
CTG CCC CAG CGC TAC 

400 

leu gly asn his ala 
CTT GGG AAC CAC GCG 



his cys pro leu arg 
CAC TGC CCG CTG CGA 

430 

ala arg glu lys pro 
GCC CGG GAG AAG CCC 



300 

pro ser val gly arg 
CCA TCC GTG GGC CGC 



arg pro pro arg pro 
CGG CCA CCA CGT CCC 

330 

glu thr lys his phe 
GAG ACC AAG CAC TTC 



arg pro ser phe leu 
CGG CCC TCC TTC CTA 

360 

ala arg arg leu val 
GCT CGG AGG CTC GTG 



met pro gly thr pro 
ATG CCA GGG ACT CCC 

390 

trp gin met arg pro 
TGG CAA ATG CGG CCC 



gin cys pro tyr gly 
CAG TGC CCC TAC GGG 

420 

aia ala val thr pro 
GCT GCG GTC ACC CCA 



gin gly ser val ala 
CAG GGC TCT GTG GCG 
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440 4 50 
aia pre glu glu glu asp thr asp pro arg arg leu val gin leu 
GCC CCC GAG GAG GAG GAC ACA GAC CCC CGT CGC CTG GTG CAG CTG 

460 

l«u arg gin his ser ser pro trp gin val tyr gly phe val arg 
CTC CGC CAG CAC AGC AGC CCC TGG CAG GTG TAC GGC TTC GTG CGG 

470 ^80 
ala cvs leu arg arg leu val pro pro gly leu trp gly ser arg 
GCC TGC CTG CGC CGG CTG GTG CCC CCA GGC CTC TGG GGC TCC AGG 
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490 

his asn glu arcr arg phe leu arg asn thr lys lys phe ile ser 

CAC AAC GAA CGC CGC TTC CTC AGG AAC ACC AAG AAG TTC ATC TCC 

500 510 

leu gly lys his ala lys leu ser leu gin glu leu thr trp lys 

CTG GGG AAG CAT GCC AAG CTC TCG CTG CAG GAG CTG ACG TGG AAG 

520 

met ser val arg asp cys ala trp leu arg arg ser pro gly val 

ATG AGC GTG CGG GAC TGC GCT TGG CTG CGC AGG AGC CCA GGG GTT 

. 530 540 

gly cys val pro ala ala glu his arg leu arg glu glu ile leu 

GGC TGT GTT CCG GCC GCA GAG CAC CGT CTG CGT GAG GAG ATC CTG 



550 

ala lys phe leu his trp leu met ser val tyr val val glu leu 
GCC AAC- TTC CTG CAC TGG CTG ATG AGT GTG TAC GTC GTC GAG CTG 

560 570 
leu arg ser phe phe tyr val thr glu thr thr phe gin lys asn 
CTC AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC 

580 

arg leu phe phe tyr arg pro ser val trp ser lys leu gin ser 
AGG CTC TTT TTC TAC CGG CCG AGT GTC TGG AGC AAG TTG CAA AGC 



590 600 
ile glv ile arg gin his leu lys arg val gin leu arg glu leu 
ATT GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG 
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ser glu ala glu val 
TCG GAA GCA GAG GTC 

620 

leu thr ser arg leu 
CTG ACG TCC AGA CTC 



pro ile val asn met 
CCG ATT C-TG AAC ATG 

650 

arg glu lys arg ala 
AGA GAA AAG AGG GCC 



phe ser val leu asn 
TTC AGC GTG CTC AAC 

680 

gly ala ser val leu 
GGC GCC TCT GTG CTG 

thr phe val leu arg 
ACC TTC GTG CTG CGT 

710 

tyr phe val lys val 
TAC TTT GTC AAG GTG 



gin asp arg leu thr 
CAG GAC AGG CTC ACG 

740 

asn thr tyr cys val 
AAC ACG TAC TGC GTG 



his gly his val arg 
CAT GGG CAC GTC CGC 



610 

arg gin his arg glu 
AGG CAG CAT CGG GAA 



arg phe ile pro lys 
CGC TTC ATC CCC AAG 

640 

asp tyr val val gly 
GAC TAC GTC GTG GGA 



glu arg leu thr ser 

GAG CGT CTC ACC TCG 

670 

tyr glu arg ala arg 

TAC GAG CGG GCG CGG 



gly leu asp asp ile 
GGC CTG GAC GAT ATC 

700 

val arg ala gin asp 
GTG CGG GCC CAG GAC 



asp val thr gly ala 
GAT GTG ACG GGC GCG 

720 

glu val ile ala ser 
GAG GTC ATC GCC AGC 



arg arg tyr ala val 
CGT CGG TAT GCC GTG 

760 

lys ala phe lys ser 
AAG GCC TTC AAG AGC 



ala arg pro ala leu 
GCC AGG CCC GCC CTG 

630 

pro asp gly leu arg 
CCT GAC GGG CTG CGG 



ala arg thr phe arg 
GCC AGA ACG TTC CGC 

660 

arg val lys ala leu 
AGG GTG AAG GCA CTG 



arg pro gly leu leu 
CGC CCC GGC CTC CTG 

690 

his arg ala trp arg 
CAC AGG GCC TGG CGC 



pro pro pro glu leu 
CCG CCG CCT GAG CTG 

720 

tyr asp thr ile pro 
TAC GAC ACC ATC CCC 



ile ile lys pro gin 
ATC ATC AAA CCC CAG 

750 

val gin lys ala ala 
GTC CAG AAG GCC GCC 



his val ser thr leu 
CAC GTC TCT ACC TTG 
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thr asp leu 
AC A GAC CTC 



glu thr ser 
GAG AGC AGC 



ser ieu asn 
TCC CTC- AAT 



phe met cys 

TTC ATG TGC 



gin cys gin 
CAG TGC CAG 



cys ser ieu 
TGC AGC' CTG 



ile arg arg 
ATT CGG CGG 



leu val thr 
TTG GTG ACA 



leu val arg 
CTG GTC CGA 



lys thr val 
AAG ACA GTG 



thr a la phe 
ACG GCT TTT 



770 

gin pro tyr 
CAG CCG TAC 



pro leu arg 
CCG CTG AGG 

800 

glu ala ser 
GAG GCC AGC 



his his ala 
CAC CAC GCC 

830 

gly ile pro 
GGG ATC CCG 

cys tyr gly 
TGC TAC GGC 

860 

asp gly leu 
GAC GGG CTG 



pro his leu 
CCT CAC CTC 

890 

gly val pro 
GGT GTC CCT 



val asn phe 
GTG AAC TTC 

920 

val gin met 
GTT CAG ATG 



met arg gin 
ATG CGA CAG 

790 

asp ala val 
GAT GCC GTC 

ser gly leu 
AGT GGC CTC 

820 

val arg ile 
GTG CGC ATC 



gin gly ser 
CAG GGC TCC 

850 

asp met glu 
GAC ATG GAG 



leu leu arg 
CTC CTG CGT 

880 

thr his ala 
ACC CAC GCG 



glu tyr gly 
GAG TAT GGC 

910 

pro val glu 
CCT GTA GAA 



pro ala his 
CCG GCC CAC 



phe val ala 
TTC GTG GCT 



val ile glu 
GTC ATC GAG 



phe asp val 
TTC GAC GTC 



arg gly lys 
AGG GGC AAG 



ile leu ser 
ATC CTC TCC 



asn lys leu 
AAC AAG CTG 



leu val asp 
TTG GTG GAT 



lys thr phe 
AAA ACC TTC 



cys val val 
TGC GTG GTG 



asp glu ala 
GAC GAG GCC 



gly leu phe 
GGC CTA TTC 



780 
his leu gin 
CAC CTG CAG 



gin ser ser 
CAG AGC TCC 

810 
phe leu arg 
TTC CTA CGC 



ser tyr val 
TCC TAC GTC 

840 
thr ieu ieu 
ACG CTG CTC 



phe ala gly 
TTT GCG GGG 

870 
asp phe leu 
GAT TTC TTG 

leu arg thr 
CTC AGG ACC 

900 
asn leu arg 
AAC TTG CGG 



leu gly gly 
CTG GGT GGC 

930 

pro trp cys 
CCC TGG TGC 
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940 

gly leu leu leu asp thr arg thr leu glu val gin ser asp tyr 

GGC CTG CTG CTG GAT ACC CGG ACC CTG GAG GTG CAG AGC GAC TAC 

950 960 

ser ser tyr ala arg thr ser ile arg ala ser val thr phe asn 

TCC'AGC TAT GCC CGG ACC TCC ATC AGA GCC AGT GTC ACC TTC AAC 

970 

arg gly phe lys ala gly arg asn met arg arg lys leu phe gly 

CGC GGC TTC AAG GCT GGG AGG AAC ATG CGT CGC AAA CTC TTT GGG 

980 990 

val leu arg leu lys cys his ser leu phe leu asp leu gin val 

GTC TTG CGG CTG AAG TGT CAC AGC CTG TTT CTG GAT TTG CAG GTG 

1000 

asn ser leu gin thr val cys thr asn ile tyr lys ile leu leu 

AAC AGC CTC CAG ACG GTG TGC ACC AAC ATC TAC AAG ATC CTC CTG 

1010 1020 

leu gin ala tyr arg phe his ala cys val leu gin leu pro phe 

CTG CAG GCG TAC AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT 

1030 

his gin gin val trp lys asn pro thr phe phe leu arg val ile 

CAT CAG CAA GTT TGG AAG AAC CCC ACA TTT TTC CTG CGC GTC ATC 

1040 1050 

ser asp zhr ala ser leu cys tyr ser ile leu lys ala lys asn 

. TCT GAC ACG GCC TCC CTC TGC TAC TCC ATC CTG AAA GCC AAG AAC 



ala gly met ser leu 
GCA GGG ATG TCG CTG 

1070 

ser glu ala val gin 
TCC GAG GCC GTG CAG 



1060 

gly ala lys gly ala 
GGG GCC AAG GGC GCC 



trp leu cys his gin 
TGG CTG TGC CAC CAA 



ala gly pro leu pro 
GCC GGC CCT CTG CCC 

1080 

ala phe leu leu lys 
GCA TTC CTG CTC AAG 



1090 

leu thr arg his arg val thr tyr val pro leu leu gly ser leu 
CTG ACT CGA CAC CGT GTC ACC TAC GTG CCA CTC CTG GGG TCA CTC 
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1100 mo 

arg thr ala gin thr gin leu ser arg lys leu pro gly thr thr 
AGG ACA GCC CAG ACG CAG CTG AGT CGG AAG CTC CCG GGG ACG ACG 

1120 

leu thr ala leu glu ala ala ala asn pro ala leu pro ser asp 
CTG AC? GCC CTG GAG GCC GCA GCC AAC CCG GCA CTG CCC TCA GAC 

1130 1132 
phe lys thr ile leu asp OP 

TTC AAG ACC ATC CTG GAC TGA TGGCCACCCGCCCACAGCCAGGCCGAGAGCAGA 

CACCAC-CAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACACCC 

AGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCC 

GGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTC 

CAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAG 

CTTTTCYTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATT 

CGCCATTGTTCACCCYTCGCCCTGCCYTCCTTTGCCTTCCACCCCCACCATCCAGGTGGA 

GACCCTCAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTA 

CACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGC 

TGTGGGAGTAAJLATACTvSAATATATGAGTTTTTCAGTTTTGRAAAJ^AAAAAAAAAAAAAA 

AAAAAAAAAA 
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